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Frequency Sampling Filters— 
Hilbert Transformers and Resonators 


By R. E. BOGNER 
(Manuscript received November 6, 1968) 


We first briefly review the principles of frequency sampling filters. 
We also show that the “conventional” frequency sampling filter can be 
modified simply to give an output which is the Hilbert transform of the 
original output. Both the original and transformed outputs are made 
available by the use of the simple complex number resonator described. The 
relationship between this system and filtering by Fourier transforming 
ts shown. 


I. INTRODUCTION 


Frequency sampling filters are filters whose frequency responses are 
synthesized as the sum of elemental frequency responses of the form 
(Fig. 1a)? 


sin [rf — fi)/fol -jensr Ac A, sin rf + fi)/fe gr iertt (1) 


VAD Brgy rg © u(f + h/t 


where 


V;,(f) is the transfer function of the kth response; 

A; is a constant multiplier, the value of the amplitude response at 
frequency fx; 

f is frequency in hertz; 

f, is the kth sampling frequency = kf,; 
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(a) 





Fig. 1— (a) Elemental frequency response contribution; (b) Elemental time 
response contribution. 


f. is the frequency interval between samples, that is, f, = fr4i—fr, 
fo = 1/DT, D = delay in samples; 
7 is the group delay, a constant for all the responses. 


Because of the constant group delay, the amplitude versus frequency 
response, |V (f)|, of the sum is given by 


Vol = Ea eed — fle 4 sna + ie | as 


By choice of the A;, suitable amplitude responses for many applica- 
tions may be specified. These will be bandlimited functions of fre- 
quency. 

The elemental time responses, v;,(t) (Fig. 1b) are convenient to 
realize by digital methods. They are truncated cosine waves. 

Figure 2 shows a comb filter, whose impulses occur DT’ seconds 
apart, followed by a resonator, whose impulse response is a cosine 
wave of frequency an integral multiple of 1/DT. The overall impulse 
response is the sum of the cosine responses to the two impulses; this 
is zero before the positive impulse, a cosine from then until DT sec- 
onds later, and thereafter zero, when the two cosines cancel. 

A complete frequency sampling filter is shown in the left of Fig. 3. 
Usually the resonators have been programmed as conventional second 


COMB 
FILTER RESONATOR 





Fig. 2— Comb filter followed by cosine resonator. 
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Fig. 3 — Frequency sampling filter, followed by Hilbert transformer. 


order systems, with slight damping to ensure stability under condi- 
tions of error in the resonator coefficients. 


II. USE AS HILBERT TRANSFORMER 


A frequency sampling filter may be readily adapted to give an out- 
put which is the Hilbert transform of that of the filter described above. 
Consider the sampling filter (Fig. 3) followed by a Hilbert trans- 
former, h(t). This is equivalent to the system of Fig. 4, where the one 
Hilbert transformer has been replaced by one at the output of each 
elemental filter. Now, in the original frequency sampling filter, the 
kth resonator has an impulse response, for time sampled systems 


g.(nT) = cos w,(nT), n = 0, 1, 2, --- 


where 7 is the sampling interval. The Hilbert transformed version of 
this is approximately 


g.(nT) = sin w,(nT). 


The approximation is discussed in Appendix A. Thus to make a system 
equivalent to the original frequency sampling filter plus Hilbert trans- 
former, we need only replace the resonators by ones with impulse re- 
sponses sin w,t. This could be done by use of modified second order delay 
resonators; but the system of Fig. 5 is more convenient programwise 
and is helpful conceptually. This system has the z transform system 
function 


W@ _ steep Ne oo 
U(z) GO = 1 — 2° exp [(a + )T] (3) 


and corresponding impulse response 


g(nT) = pee n= 0, ee (4) 
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Fig. 4— Frequency sampling filter with separate Hilbert transformers. 


For a = 0, the real and imaginary parts are cos onZ’ and sin wonT. A 
small negative value of « would be used for stability. 

The frequency sampling filter then has the form of Fig. 4, with each 
channel containing one complex number resonator instead of the res- 
onator plus Hilbert transformer. The output at each sampling time is 
a complex number, whose real part corresponds to the output of a 
conventional frequency sampling filter, and whose imaginary part is 
an approximation to the Hilbert transform of the real part. 

In Appendix A, the analysis of the approximation results in the 
following observations: 


(1) The Hilbert transformer cannot handle signals with frequencies 
tending to zero. 

(iz) For signals with low-frequency components, care is necessary 
in specifying the frequency samples to ensure that the negative-ire- 
quency tail of the positive-frequency response component is of small 
amplitude. 

(wt) The errors are in the amplitude and not phase characteristics. 


The system is capable of filtering a complex input, wu + jv without 
modification of the resonators. 


III. RELATION TO DISCRETE FOURIER TRANSFORM 


Consider a = 0. The response of the kth resonator at time nT, 


n = 0, 1, 2, --- , to a unit pulse at time mT is exp [ju,(n — m)T]. 
Hence the response at time nT to a signal s(mT’), m = --- , —1,0, 
1, 2, --- is: 


n 


> s(mT) exp [jo.(n — m)T] 


MaF—-O 


exp (jo,.nT) o> s(mT) exp (—jo,mT). (5) 


x,(nT) + jy.(nT) 


I 


{I 
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w(t) =x (t) tjy (t) 
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Fig. 5— Complex number resonator. 


When the comb filter precedes the resonator, the effect of its nega- 
tive impulse, occurring D7’ seconds after the positive impulse is to 
add the second term of (6): 


BOT) Eyal) <x Gant) > Sn T exp (=D) 


— exp (jwnT) >> s(m — D)T exp (—jo,mT) 


n 


= exp Ginnt)| S> s(mT) exp (—jo,mT) 


m=—0 


n-D 
— >> s(mT) exp (—jw,mT) exp (— jox pr) |. (6) 
But DT is an integral multiple of the period 27r/w, as mentioned in 
Section I; thus exp (—jw,DT) = 1. Hence 


n 


a,(nT) + jy,.(nT) = exp (janT) >> s(mT) exp (—jo,mT’). (7) 
=n 1 

This expression may be recognized as an oscillation exp (jw,n7') whose 

coefficient is the value at frequency w, of the Discrete Fourier Transform 

(DFT) of s(mT), computed over the last D samples. The output of the 

frequency sampling filter, taking into account the weights A, , is 


unl) + jy’) = D A,[z.(nT) + jy(nP)] 


DX exp (jw,nT) A, >» _S(mP) exp (— jw, mT). (8) 


=n—-D 
This is the Fourier synthesis (inverse DFT) of the frequency function 


n 


A, > s(mT) exp (—jo,mT), k= Lo 2s sey (9) 


m=n—-D+1 


which may be regarded as the product of the running DFT of s(mT) 
and a DFT whose values at frequencies w, are the A, . 
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Frequency sampling filtering is thus equivalent to filtering by Fou- 
rier transforming, multiplying by a filter frequency function, and in- 
verse transforming. 

The filter frequency function (A,;, k = 1, 2,...) has, so far, been 
considered real. There is no reason why the A; should not be com- 
plex, permitting the filter to have an arbitrary phase characteristic. 
The complex values of the A; may be specified in cartesian or polar 
form, the latter being more convenient for amplitude-phase specifica- 
tion, 

Another way of looking at the resonator output is obtained by re- 
arranging (7): 


0 


a(n) + jy,(nT) = >> _s{(m + nT exp (—jw,mT). (10) 


m=—(D—-1 
This may be recognized as the DFT of the last D values of s(mT), 
shifted in time so that the latest occurs at time m7’ = 0. 


IV. CONCLUSION 


The use of complex number resonators in a frequency sampling 
filter provides a Hilbert transformed output as well as the conven- 
tional filtered output. The system can readily accept a complex time 
function as input, and has a very simple flow chart. The output is 
equivalent to that obtained by the use of Fourier transforms to per- 
form filtering in the frequency domain. 

A sampling filter subroutine using the ideas presented has been 
written in Fortran IV. It has been used for filtering and Hilbert 
transforming speech signals in a number of tasks. 


Vv. ACKNOWLEDGMENT 
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APPENDIX A 


Errors in the Hilbert Transformer 


A cosine wave, truncated in time, is the basis of the frequency sam- 
pling filters. A correspondingly truncated sine wave has been used 
as an approximation to the Hilbert transform of the cosine. The 
errors in this approximation will be analyzed by comparing the 
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Fourier transform of the truncated sine wave with that of the true 
Hilbert transform of the cosine. The analysis is for continuous (that 
is, nonsampled) sines and cosines. 

The truncated cosine response is taken to be 


Q2rNt 
eo 


T 
h(t) = co p 75 


IA 
pd ae | 


t 


IIA 


= 0, elsewhere. 
The F transform of h,(t) is 


sin If — x sin (PF oh xy 


T 
HQ) = =|—>- + + Ss (11) 
a3) a) 
= H,,(f) + H..(f), respectively. (12) 


H,(f) may be separated further into main responses and “tails” 
(Fig. 6): 


H.(f) = Aas (f) os H4-(f) ae H.24(f) =F FH .2(f) (13) 


where 


Ha.=Ha, $>0; “82, ;=0; 0, f<o 


Ha.=0, f>0; 82, f=0; Ha, f<0 
Ho. -Ha, {>0; 2%, ¢=0; 0, ¢ <0 


Hy =0, {>0; 222, f= 0; Hay J <0. 


The F transform of the Hilbert transform [h,(t)] of h.(é) is then 


H(f) = —jsgn (HCP) (14) 
= —jHas(f) + jHa-(f) — jH2+(f) + jo). (15) 


The truncated sine response is taken to be 


h,(t) => sin Agee ’ ~ 2 t = 


A 


QrNt T T 
2 


= 0, elsewhere. 
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Fig. 6 — Components of elemental frequency response. 


The F transform of h,(t) is 


a2 rT I ~ v) : ie T(t + N 
ali -3) “al ¥) 


which by comparison with (11), (12), (13) is seen to be 


H,(f) —jH.,(f) + 3H .2(f) 
JH iG) > jhaaG) + jh) + jh acQ): (17) 
Then from (15) and (17): 


H,(f) = H.(f) — 2jHa-(f) + 27H 224 (f). (18) 


The error in approximating H,(f) by H,(f) is thus attributable to the 
tails H..-(f) and H..,(f), which are small for N > 1. From the defini- 
tions (11), (12), (18), it follows that these tails are related: 


H.,-(—f) = Hea+(f). (19) 


In a complete frequency sampling filter, the transforms corresponding 
to all the time responses are to be added. Errors in the “Hilbert trans- 
formed” output, y, as compared with the straight filtered output, 2, 
are determined by the resultant tails; these tails may be of small 
amplitude if suitable values are chosen for the frequency samples. 
Just what criterion of smallness should be applied depends on the 
application. Some general observations may be made, however: 


H,(f) = (16) 


wl 


I! 


I 
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(t) The Hilbert transformer cannot be useful to zero frequency 
because a zero frequency sample has tails equal to the main responses, 
and would thus contribute gross errors. This is of course consistent 
with the infinite duration of the impulse response (1/t) of a true 
Hilbert transformer. 

(zt) To transform signals with low frequency components, many . 
frequency samples may be required to provide the sharp and con- 
tinued cutoff required for tail suppression. 

(wt) Since Hy—(—f) = H.2i(f), it follows from (18) that the 
errors, associated with H,,_(—f) and H,24(f) are directly in or out 
of phase with the relevant main responses. The error in the Hilbert 
transform is thus an amplitude and not a phase error. This result is 
also consistent with the observation that the approximate Hilbert 
transformed response to an impulse is truly odd. 


APPENDIX B 


Relationship between Complex Number Resonator and Conventional 
Second Order Resonator 


While the formal transform relation between (3) and (4) is readily 
shown, it is satisfying to explain how the seemingly first order delay 
system can produce an oscillatory response. The system of Fig. 5 is 
described by the equation 


(mT) + jy(mT) = w(mT) +e"! xm — 1)T + jy(m—1)T) (20) 


When a pulse u(0) = 1, with zero before and after is applied, the first 
response 1s 


x(0) + jy) = 1+ 90 
The next response is simply the first response multiplied by e 
x(iT) + jy(QT) = e**? "(1 + 30); 


there is a similar multiplication at each subsequent sampling instant, 
yielding the impulse response 


a(nT) + jy(nT) = ht??? n=0,1,2-5°, (21) 


(atiw)T 


equivalent to (4). 

The complex number resonator may be shown to contain a second 
order delay feedback, making its oscillatory response consistent with 
that of the more conventional second-order systems. Its equation (20) 
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Fig. 7 — Expanded flow chart for complex number resonator. 
may be examined by equating separately real and imaginary parts: 
a(mT) = u(mT) + *? cos wf)x[(m — IT] 
—(¢*"sineD)y[((m — HT] (22) 


y(mT) = (e*7 sin wT)2[(m — 1)T] + (e*7 cos wT)y[(m — 1)T] (23) 


Equations (22) and (23) may be represented by the flow chart of 
Fig. 7. There is, in fact, a path of delay two sampling intervals from 
the real output x, via y, the imaginary part of the output, back to 
x. Thus, y could be considered to provide the necessary memory for 
the second delay. 

One aesthetically pleasing feature of the representation (Fig. 7) is 
the symmetry. If a complex input, wu + jv were to be filtered, then v 
would be found to be applied to the lower summer. 
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Scattering from Dielectric Mirrors 


By D. GLOGE, E. L. CHINNOCK, and H. E. EARL 


(Manuscript received September 9, 1968) 


Most of the light scattered from high-reflectivity dielectric mirrors 1s 
radiated into directions close to the reflected beam. We measured the angular 
power distribution at angles between 0.01° and 1° from the beam axis by 
scanning with a narrow slit. From this a linear structure function 1s 
calculated for coherence lengths between 20 microns and 1 millimeter, 
assuming isotropic surface statistics. The corresponding power density 
decreases with the third power of the scattering angle. The power outside 
a given radius and the power density is plotted for various wavelengths 
and distances. 


I. INTRODUCTION 


The improvement of dielectric mirrors during recent years has re- 
duced their surface scattering considerably. Nevertheless, there are 
applications which are limited by these small amounts of scattered 
light. One of them is the laser gyroscope whose locking threshold 
depends on the light scattered back into the direction of incidence. 
Measurements have been performed recently to analyze this case.* 

Another application is the simultaneous transmission of many laser 
beams in an optical waveguide for communication purposes.? The 
focusers in such a guide will probably be front surface mirrors rather 
than lenses because, for the large apertures needed, lenses are apt to 
have imperfections in the bulk. Dielectric mirrors have fewer im- 
perfections, but they still scatter some light into adjacent beams where 
it produces crosstalk. It was the purpose of our experiment to measure 
some representative mirror surfaces as a basis for later feasibility 
studies on multiple beam waveguides. Only the light in a narrow cone 
around the beam is collected by the next focuser and eventually con- 
tributes to the crosstalk. The experiment showed that, in this cone, 
the scattered light intensity decreases relatively fast with increasing 
angle. 
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Applying these results to more complicated problems requires a 
simple but adequate mathematical representation of the results. We 
found that the standard scattering theory, which uses a covariance 
function to describe the mirror surface statistics, serves this purpose 
very poorly? On the other hand, a simple structure function can be 
found which is a satisfactory representation of the physical reality in 
the range of the measurements and is easily applicable to practical 
problems. 


II. SCANNING THE SCATTERED POWER DISTRIBUTION 


The measurements were performed with a 50-cm He-Ne laser gen- 
erating a 1-milliwatt gaussian beam at 6328A. To achieve enough 
sensitivity and discrimination against noise, the laser beam was 
chopped for signal processing in a lock-in amplifier as shown in Fig. 1. 

A slit was used to scan the scattered light. This requires scanning 
only in one direction (while the slit averages over the perpendicular 
coordinate) and more signal power is collected than with a pinhole 
method. Because of its circular symmetry, the scattered power den- 
sity can be calculated from this measurement by a simple integral 
transformation. 

To avoid scattering from dust particles in the beam path to and 
from the mirror, this path was evacuated to about 4 torr. But care- 
ful comparison with measurements in unfiltered, though quiet, air 
showed no measureable difference. 

The mirror had a radius of curvature of 24 m and a diameter of 
15 cm. The beam, having a 1/e-width of 24 mm at the mirror, was 
focused to 0.8 mm in the plane of the slit. The slit was 0.15 mm 
wide. Figure 2 shows the relative intensities normalized to the peak 


CHOPPER 
WHEEL WINDOWS [< aaa oes 24m —------------- 4 
1 


AMPLIFIER 





t I 
HORIZONTAL SCANNING EVACUATED 24M RADIUS MIRROR 
FOR SLIT AND MULTIPLIER TO 4 TORR TO BE TESTED 


Fig. 1— Setup to measure the scattering under vacuum. 
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Fig. 2—Scattered power at 24-m distance normalized to the peak power 
(the vertical lines are part of the coherent beam profile). 


intensity in dB and plotted versus the vertical coordinate. In this 
logarithmic plot, the gaussian intensity profile of the coherent signal 
has a parabolic shape, part of which is represented by the almost 
vertical lines in the center of Fig. 2. If diffraction and spherical 
aberrations are taken into consideration, the fall-off is not quite as 
sharp as indicated by the parabola, but these effects were estimated 
to be well below the light levels measured. Therefore, we believe that 
surface scattering is the sole source for our results. The scanning 
range in Fig. 2 corresponds to angles from 0.01 to 0.1 degree. 

For larger angles up to 1 degree, a 1-m set-up was used which was 
basically similar to the one shown in Fig. 1, but had no vacuum en- 
closure. The five mirrors tested in this arrangement had a radius of 
curvature of 1 m, a diameter of 25 mm, and the same coating as the 
24-m mirror. The test beam in this set-up was 4 mm wide at the mirror 
and was focused to 0.2 mm at the slit. The slit had a width of 0.05 
mm. Coatings from different batches showed up to 3 dB difference. 
Figure 3 shows average and variation of the results. Again the in- 
tensity normalized to the peak intensity is plotted in dB. The profile 
of the coherent beam is shown in the center. 
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2.5 5.0 7.5 10.0 12.5 15.0 17.5 20.0 22.5 25.0 
xz’ IN mm 


Fig. 3—-Scattered power at 1l-m distance normalized to the peak power (the 
vertical lines are part of the coherent beam profile). 


Both the 24-m mirror and the 1-m mirror were polished and 
coated by the same methods though by different manufacturers. They 
were tested to be spherical within 4/10. The reflection loss of the 24-m 
mirror was measured by a multiple reflection technique to be 0.135%. 
The mirrors were measured new without previous use, but no increase 
of the scattering was measured by repeated checks during the follow- 
ing weeks. Further lifetime studies are under way. 


III. DESCRIPTION OF THE SCATTERING SURFACE 


The scattering plotted in Figs. 2 and 3 originates from a slight 
roughness or ripple structure 6(X, Y) on the mirror surfaces which 
is, of course, different for different mirrors. The surfaces tested in this 
experiment, however, were manufactured by the same process and 
are therefore equivalent in a statistical sense. That implies that the 
average magnitude of each ripple component, that is, the “power spec- 
trum” of 6(X, Y), is the same from mirror to mirror. The average 
has to be taken over an ensemble of test surfaces; however, for cor- 
relation lengths small compared to the test area, the ensemble average 
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may be replaced by an average over the individual surface. In this 
case, measuring only one or a few surfaces still yields a meaningful 
result, though only for correlation lengths small compared to the 
radius w of the light beam at the mirror surface. 

The “power spectrum” is closely related to the scattering profile 
measured by the slit method. A vertical slit at X’, as in Fig. 4, col- 
lects mainly light scattered from the vertical ripple component with 
the spatial frequency 

XxX? 


t= Ty (1) 


where . is the light wavelength and L the distance between slit plane 
and mirror. Therefore, apart from a constant, the scattered profile 
s(X’) of Figs. 2 and 3 agrees with the “power spectrum” d,(x) of 
8(X, Y) for Y = constant.* The quantitative relation between d, and 
s 1s given in (38) of Appendix A and reads 


3 

da) = a erf (2 a) (2) 
where ¢ is the slit width and w the 1/e-width of the gaussian light 
beam at the mirror surface. A log-log plot of d,(xz) is shown in Fig. 
5. The points on the left hand side are taken from Fig. 2 and rep- 
resent the 24-m experiment, the ones on the right hand side stem 
from Fig. 3 and the 1-m experiment. Since the mirrors are statistically 
equivalent, all these points belong to the same function. A rough ap- 
proximation is attempted by the straight line in Fig. 5 which represents 
the function 


D 
d,(x) = af (3) 
with 
D = 6-10" mm. (4) 


The Fourier transform of d, is the covariance of 8(X, Y) along 
lines Y = constant. It proves impossible, however, to perform this 
transform without knowing d, for very small x where it increases 
rapidly. Accurate information about this range is unnecessary if the 


* Strictly speaking, s(x) is a two-fold convolution of dz(x) with the intensity 
profile of the beam and the slit aperture function; but because the latter two 
functions are very narrow as compared to the scattered profile, the above sim- 
plification is appropriate. 
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MIRROR 








Fig. 4—Sketch of the experiment showing the coordinate system used. 


structure function 
A(X, = Xs ? Y,—- Y2) = ((6(X, ’ Y;) = 5(X2 ; Yay jis (5) 


is used instead. The interrelation between A and d, is derived in the 
Appendix A and given in (39). It involves the transformation 


/ : d,(x) sin” (rX2) dx. 


The sin?-kernel of this transformation reduces the contribution from 

the zero-end of the d,-function and A(X, 0) can therefore be calculated 

more accurately in the range of interest than the covariance. 
Inserting (8) into (39) yields the functional approximation 


ee ee. % ( 32n' )| 
A(X, 0) = Oa? E we + In\{1l — 2 DX ’ (6) 
where X has the meaning of a correlation length. In the range X < w, 
which is shown in Fig. 6, the structure function (6) is essentially a 
straight line given by 


A(X, 0) = 4n°DX. (7) 


This result suggests that the mean square difference between sam- 
ples of 8 increases proportionally to the distance at which they are 
taken. At the right side of Fig. 6 the quantity (A) can be read off, 
which indicates a direct measure of the heights of the surface ir- 
regularities as a function of their extension about the surface. This 
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Fig. 5— Power spectrum of the mirror surface roughness. The line represents 
a best-fit approximation to the measured points. 
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Fig. 6—The structure function A calculated from the approximated power 
spectrum. The broken line indicates the possible uncertainty of the result. 
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quantity does not, at least not in the measured interval, approach a 
definite rms value, but increases unlimited for larger and large sam- 
pling distances. This is not necessarily in contrast to physical reality, 
but one has to keep in mind that, for macroscopic sampling distances, 
the statistics of 8 are probably governed by a different process, which 
could mean a steeper rise as well as a leveling off for the structure 
function. 

Unknown contributions from outside the measured interval of d, 
will to some degree affect the accuracy with which (7) can be evaluated. 
Ruliog out any poles of d, for x ~ 0 (which would mean nonstatistical 
components), a ‘‘worst case” may be established by assuming that (3) 
holds only in the measured interval 7, < « < x. , everywhere else 
d.(z) = 0. Then from (89) withz <<wandA<1 


Ze 2 2 
A(X, 0) = sp | EAS ay (8) 


with xz; = 0.8mm and x2 = 30mm”. The dashed line in Fig. 6 shows 
the evaluation of this integral. The accuracy seems satisfactory for 
coherence lengths between 20u and 1 mm. 

Though A(X, 0) describes the statistics of § only along lines Y = 
constant, this result can easily be generalized assuming that the mir- 
ror surface is isotropic. Then the structure function has circular 
symmetry and can be expressed as a function of the radius R = 
(X + Y?)*%. This function reads 


A(R) = A(R, 0) (9) 
where A(R, 0) is given by (7). 


IV. THE DISTRIBUTION OF THE SCATTERED POWER 


For most applications the actual scattered light distribution around 
the beam is of more immediate interest than the structure function. 
Of course, this light distribution not only depends on the properties 
of the mirror, but also on the properties of the light beam reflected 
off the mirror. More specifically, this light distribution is the convolu- 
tion of the intensity profile of the primary beam with the “power 
spectrum” of the mirror irregularities. Only when the beam profile 
is very narrow, as in our experiment, do the scattered light distribu- 
tion and the “power spectrum” become proportional functions. 

In this section we evaluate this distribution for various optical 
wavelengths in arbitrary cross sections of the beam. If applied to 
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problems where the width of the beam may not be neglected, the 
convolution of this function with the intensity profile of the beam has 
to be formed. 

Because of the isotropy of the mirror surface, the scattering has 
circular symmetry. If we define a normalized radius r = (2? + y?)*, 
the scattered light distribution p(r) can be calculated from (32) of 
Appendix A. By substituting x by r in (32), one obtains 


2t f° _pryr dr_ 
LN J, (r? — 2°)?’ 
where r is related to the radius R’ = (X” + Y’)*” by the normaliza- 
tion 


S(a) = (10) 


R’ 
~ DN aD 
One can solve (10) for p(r) by multiplying both sides by «dz/ 
(v? — r?)*% and integrating with respect to x from r to o.* After inter- 
changing the order of integration on the right-hand side, the integral 
over x can be evaluated and one obtains 


r 


EX f° 2S@) de _ { eo ; 
ot ; (a = r°)? = : 2rrp(r) dr. (12) 
The integral on the right represents the total power scattered out- 


side a circle with radius r and will be called P(r) in the following. 
Insertion of (2), (27), and (35) into (12) yields 


Sr f° ad,(x) dx . 
tot )2- ; (a? — r*)*? 
and the power density p(r) is finally obtained from the differentia- 
tion 


P(r) =P (13) 


a (14) 
By using (8) for d, in (18), one obtains for the power outside the 
radius r 


4r* D 
Pr) = aay a (15) 


and the power density 


2r° D ; 
pe) = Fr =a Pro - (16) 
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To gain information about the power density as a function of the 
scattering direction, one may multiply (11) by A and find the scat- 
tering angle 


, 
pawae. (17) 
The power scattered into directions deviating by more than p from 


the beam axis is obtained by inserting (17) into (15) 


P(#) = PP... (18) 


The derivative with respect to p yields the angular power density 


1 dP 2° D 
P,(p) = op o P ect (19) 


Equations (18) and (19) are evaluated for various wavelengths in 
Figs. 7 and 8. Figure 7 shows the power fraction outside p which de- 
creases linearly with increasing radius. Fig. 8 shows the power frac- 
tion radiated into a given solid angle at p. This function decreases 
with the third power of p. 

Finally, for a certain distance L, one finds the power arriving out- 
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Fig. 7— The total power fraction scattered with an angle larger than p off 
the beam axis for various wavelengths X. 
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Fig. 8 — The angular power density as a function of the scattering angle p 
for various wavelengths X. 


side a circle of radius R’ to be 


R\ _ 45 DL 
P(e) = 47 XR’ tot (20) 


and the derivative with respect to R’ yields 


iy: 1 dP = 2 DL 
Dr (R’) = 57 R? aR’ ~ 2a iR”™ Piss (21) 





where pp, is the scattered power density at a distance L. Equation (20) 
is plotted for various Z in Fig. 9. Figure 10 shows the power density 
versus the radius which decreases with the third power of R’. It is 
interesting that the power density at a fixed radius increases pro- 
portional to the distance from the scatterer. 

Of course, equations (15) through (21) hold only for r < 0.8 mm”™, 
the lower limit of the interval measured, and are based on the assump- 
tion that (8) is valid for r > 30 mm™*. However, as d, is small in the 
latter region, a possible error introduced by this assumption should not 
be significant in the range of interest. 


Vv. CONCLUSIONS 


The small angle scattering was measured for very highly reflecting 
dielectric mirrors. A reasonable functional approximation for the 
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Fig. 9— The total power fraction scattered outside a circle with radius R’ at 
various distances L. 


measurements leads to a linear structure function for coherence 
lengths between 20 microns and 1 mm. The rms difference between 
surface deviations found at two points 1 mm apart is 30 Angstroms 
and decreases with the root of the distance for points closer together. 
At an angle of 0.1° the scattered power density per cm? is 10-* of the 
total power. It decreases proportional to the third power of the angle 
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_ Fig. 10 — The scattered power density, as a function of the radius R’ at various 
distances L. 
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and linearly with the light wavelength. No considerable differences 
were found for mirrors polished and coated by two different manu- 
facturers which were using the same processes and chemicals. 
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APPENDIX 


The Structure Function 


Consider a coherent Gaussian beam of wavelength . to be reflected 
off the mirror in Fig. 4 and focused on a plane with a slit. Assume 
that unperturbed phase fronts emanating from the mirror were 
spherical with a field distribution 


E(X, Y) = E, exp [—(X’ + Y*)/w’]. (22) 


The mirror diameter may be considered sufficiently large compared 
to the 1/e-width w so that the field at the mirror edge may be 
neglected. In this case, the field in the focal plane is 


f(x, y) = / ii e eM H(X, Y) dX dy - (23) 
where 
x? sf 
t= Ty and Y= (24) 


are the normalized coordinates in the focal plane (see Fig. 4). The 
solution of (23) is 


f(x, y) = Eyw*n exp [—2'w*(2* + y’)). (25) 
The total power 
2 
Prot = rm (26) 


can be calculated by integrating (22) or (25). 

Assume that the slit in the focal plane is long enough to collect all 
the power in y-direction and has a width ¢ in x-direction. Then for 
X’ = 0 the signal received is 





S(O) = Pros ert (2 ) (27) 
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where erf denotes the error function. This is the peak signal measured 
with no scattering present. However, for the case in question where 
the scattered power is less than a percent, (27) can also be used for 
the peak center signal of the scattering profile. 

The scattering under consideration originates from a slight rough- 
ness 6(X, Y) of the mirror surface which gives rise to a phase varia- 
tion 


o(X, ¥) = = ax, ¥) (28) 


on the otherwise perfect phase front. The term 8(X, Y) is assumed 
to be a gaussian random process with isotropic statistics. Its structure 
function is given by (5). 

It can be shown that for gaussian statistics® 


(exp t[o(Xo , Yi) — o(X2 , Yo)]) = exp | -2 a|- (29) 


The power density in the focal plane can be calculated from (23) by 
introducing the phase factor exp[—ie(X, Y)] and then multiplying 
(23) by its conjugate complex. Using (29) yields finally 


ple, y) = fll BX, , Y)E% , ¥:) 


2 
rm | -8¢ A 3 OS v.) | 


-exp [—727(X, — X,)z] 
“exp [—727r(Y, — Y.)y] dx dX» adyY, ay, . (80) 


After a standard coordinate transformation, this becomes 
p(t, y) = Pro ff exp (-(X? + ¥9)/2u") 


“exp | -85 A(X, v) | exp [—22r(Xx2 + Yy)]dX dY. (81) 


To measure the relatively flat power distribution outside the co- 
herent beam, one may average over the slit width ¢ and consequently 
the signal measured in this region is approximately 


se) = i [pew ay. (32) 
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Because of (81), this becomes 


+00 1 xP 8 2 sigs 
S(z) = Fe Pun OP | -i% = sz A(X, 0) fe ea (88) 
For eee convenience the normalized signal 
_ S(a) 
a(0) = Sp) (34) 


was measured and plotted in Figs. 2 and 3. From (27) and (33) one 
finds 


co = flay) 


ae 1x? Sr" —i2eXz 
a exp | ~3 Poe a(x, 0) fe dX. (85) 


Inverting the Fourier transformation in (35) yields 








LX 2 
exp | -2 42 — 8 ac, 0) | 


“ i (0 rwt ) i ianXz 
= erf 5 ED s(x)e ae: (36) 


The evaluation of this integral is problematic for small x where the 
measurements are impeded by the coherent beam, but where s(x) is 
large and contributes significantly to the de component of A(X, 0). 
To overcome this difficulty, the identity A(0, 0) = O can be used 
which is based on the definition (5) of the structure function. Incor- 
porating this identity (36) can be rewritten in the form 


2 2 
1-— —— — SF ax, 0) | 


ot (Fy) {eed 


The function 





die) = JAS erf (= ve (a) 


may be interpreted as the ee spectrum” of 6(X, Y) for Y = 
constant. Consequently, x has the meaning of a spatial frequency 
related to the Fourier components of § along lines Y = constant. d, 


(38) 
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as well as s are even functions of x. Therefore, the solution for A(X, 0) 
can be written in the form 


ING ae a *, 216 E eee / da) ee GXD ix}. 


NM Jo 
(39) 
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Apparent Increase in Noise Level 
When Television Pictures 
Are Frame-Repeated 


By F. W. MOUNTS and D. E. PEARSON 
(Manuscript received October 16, 1968) 


Subjective measurements were made of the apparent increase in noise 
level which occurs when television pictures are frame-repeated. We show 
that in all cases of practical interest this increase is small (less than 3 dB), 
that 2t is dependent on the type of scanning (greater increases with line- 
sequential than with line-interlaced scanning), and that tt is relatively 
independent of the picture signal-to-noise ratio. At smaller numbers of 
repetitions—the region which shows most promise for practical schemes of 
bandwidth saving—the increase in apparent noise level with increased 
frame-repetition ts most rapid. 


I. INTRODUCTION 


Seemingly attractive schemes for compressing the bandwidth of tele- 
vision signals sometimes render the signal highly sensitive to noise. 
As a result, the signal-to-noise ratio requirement for the channel be- 
comes extremely large.t If this requirement is not met, the errors 
caused by the noise degrade the picture to an intolerable degree. Thus, 
noise is a great obstacle to the success of bandwidth-saving schemes, 
and the authors of any such schemes should always take care to check 
the noise-sensitivity of their compressed signals. 

We report in this paper on some measurements we have made of 
the noise-vulnerability of picture signals in a frame-repeated tele- 
vision system. In a previous paper concerned with the possibilities of 
bandwidth-saving by frame-repetition or frame-replenishment, Brain- 
ard, Mounts, and Prasada made the observation that with increasing 
numbers of repeated television frames there appears to be an increase 
in the picture noise level.? The noise pattern is “frozen” for the repeti- 
tion period; this tends to make it more visible to the eye. Any source 
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noise, such as that created by delta modulation or pulse code modula- 
tion (PCM), or any channel noise, such as random gaussian noise, has 
its effective power raised by the process of frame-repetition. We at- 
tempted to measure this apparent increase in noise power in quantita- 
tive terms to determine what improvements in modulation methods or 
channel noise levels would be required if television pictures were 
frame-repeated. 


II. EXPERIMENTAL EQUIPMENT 


The equipment used to produce the frame-repeated pictures has 
been fully described in Ref. 3. For the experiments reported in this 
paper random noise was added to the video signal prior to frame- 
repetition. An automatically-timed switch was arranged to present, 
alternately, on a single display monitor, frame-repeated and non- 
frame-repeated (standard) versions of the same picture. Controlled 
amounts of noise could be added to the frame-repeated picture by 
the experimenter and to the standard comparison picture by the sub- 
ject. This arrangement permitted the subject to carry out a visual 
match of the levels of noise in the frame-repeated and standard pic- 
tures. A block diagram is given in Fig. 4; a more detailed descrip- 
tion of the apparatus is given in Appendix A. 


Ill. EXPERIMENTAL METHOD 


The method we used to measure the apparent increase in noise 
level in a frame-repeated picture was to ask 24 subjects to view in 
succession frame-repeated and standard versions of the same picture. 
With controlled amounts of noise added to the frame-repeated picture 
by the experimenter, the subjects were required to adjust the noise 
level in the standard picture until the noise level in the two pictures 
appeared to be the same. The difference in the actual or measured 
noise levels was taken as the apparent increase in noise level. Judg- 
ments were obtained for various numbers of repeated frames, for 
both interlaced and sequentially-scanned pictures, and for several 
values of signal-to-noise ratio. The study was restricted to band- 
limited white gaussian noise. A single still picture (Fig. 1) was used 
in all the trials as a background against which the noise was viewed. 
Details of the test conditions and subject instructions are given in 
Appendix B. 
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Fig. 1— Test picture used in the experiment. 


IV. RESULTS 


Averaged results for the 24 subjects are shown in graphical form in 
Figs. 2 and 3. In both sets of graphs the average apparent increase in 
noise level is plotted as a function of the repetition ratio. With se- 
quential scanning (Fig. 2) the time taken to scan a complete frame. 
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Fig. 2— Sequential scanning: (a) 30 dB signal-to-noise ratio; (b) 40 dB signal- 
to-noise ratio; (c) 50 dB signal-to-noise ratio; (d) combined results. 


was 1/60 second, whereas with interlaced scanning (Fig. 3) it was 
1/30 second. This fact often leads to semantic confusion when com- 
paring the two cases; what we have termed 2:1 frame-repetition of 
an interlaced picture is sometimes loosely referred to as 4:1 frame- 
repetition by virtue of the fact that the time period during which the 
picture is repeated is the same as that for 4:1 sequentially-scanned 
pictures. To emphasize our usage we have indicated the repetition 
period in seconds along the horizontal axis in all graphs. 

Separate curves are plotted for each of the three signal-to-noise 
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ratios used in both sequential- and interlaced-scan conditions. These 
signal-to-noise ratios are the true or measured ratios and refer to 
the frame-repeated picture, not the standard picture used for a com- 
parison; thus, at higher numbers of repetitions the apparent signal- 
to-noise ratios are less than the stated figures by an amount equal to 
the ordinate of the curve. No attempt has been made to fit a smooth 
curve to the measured points as there was no way of knowing what 
type of curve to fit. Instead, the points have been connected by 
straight-line sections. 

Several subjects experienced difficulty in adjusting for subjective 
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Fig. 3— Interlaced scanning: (a) 33 dB signal-to-noise ratio; (b) 43 dB signal- 
to-noise ratio; (c) 53 dB signal-to-noise ratio; (d) combined results. 
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equality when the signal-to-noise ratio was 50 dB in the sequential 
case and 53 dB in the interlaced case. At these levels the noise is very 
near the threshold of visibility, and even allowing for the increase in 
noisiness with frame-repetition, the noise is barely perceptible. Fur- 
thermore, at these signal-to-noise ratios small alterations in noise 
level are much less noticeable than at 30 dB or 40 dB signal-to-noise 
ratios. The subjects discovered that for a 50 dB signal-to-noise ratio 
they could rotate their attenuator control through a number of steps 
without affecting the relative appearance of the two pictures. Two 
subjects maintained that altering the attenuator over its full range 
made no perceptible difference to the picture at 50 dB, and in con- 
sequence, when presented with a comparison at this noise level, merely 
reiterated their setting for the previous presentation. The variability 
of the 50 dB settings, as compared with the 30 dB and 40 dB settings, 
reflects these difficulties. In Figs. 2a, b, and c, and 3a, b, and c the 
standard deviations o of the plotted means are shown. These were 
calculated according to the formula 


=(4 Yo x).0) | 


where the X; are the 24 results whose mean (X),, is plotted in the 
graph. The vertical lines about each plotted point extend to -to. 
Figure 2d is a superimposition of Figs. 2a, b, and ¢ for comparison 
purposes. Similarly Fig. 3d is a superimposition of Figs. 3a, b, and e. 


V. DISCUSSION 


A point of interest about the results is the obvious difference be- 
tween the graphs for interlaced and sequentially-scanned pictures. 
The apparent increases in noise level are substantially larger in the 
sequential case (Fig. 2) than in the interlaced case (Fig. 3). For ex- 
ample, consider four presentations of each picture. From Fig. 2d 
the average increase in noise level for sequential scanning is about 
3 dB, while from Fig. 3d the increase is seen to be a little over 1 
dB. If this comparison is deemed to be unfair because the period of 
repetition is not the same in each case, then consider the 4:1 frame- 
repeated sequential case against the 2:1 frame-repeated interlaced 
case. Again, the difference is substantial. 

A partial explanation of this difference may be given in terms of 
the time period of repetition. The scanning of a single frame takes 
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twice as long in the interlaced case as it does in the sequential case. 
Thus, in standard or non-frame-repeated interlaced pictures every 
noise sample or element on the screen is seen for twice as long (1/30 
second) before replenishment by a different noise sample as it is in 
sequentially-scanned pictures (1/60 second).* With a 2:1 frame- 
repeated sequentially-scanned picture, noise samples are replenished 
by different samples every other frame, that is, also at 1/30 second 
intervals. Therefore, crudely speaking, a standard interlaced picture 
is a 2:1 frame-repeated sequentially-scanned picture with a different 
order of line presentation. Hence, from Fig. 2 we would expect about 
1.5 dB difference in the apparent noise level between standard inter- 
laced and standard sequential pictures having the same added noise 
pattern, with the sequential picture having the higher apparent signal- 
to-noise ratio.t We have observed with our system that, by taking 
a picture with a fixed amount of added noise and changing the read- 
out method from sequential to interlaced with no frame-repetition, 
there appears indeed to be a slight increase in noise level.t This ob- 
servation is only that of the authors, however, and we have yet to 
confirm it under properly-controlled conditions with a larger sample 
of subjects. Until this experiment is performed it should not be as- 
sumed, from Fig. 2 and 3, that a 4:1 frame-repeated sequential pic- 
ture looks noisier than a 2:1 frame-repeated interlaced picture, each 
having the same signal-to-noise ratio. 

If an ordinary interlaced picture can be equated to a 2:1 frame- 
repeated sequential picture, it follows that the upper portion of Fig. 
2d, above a horizontal line drawn through 1.5 dB, should correspond 
to Fig. 8d. It can be seen that this correspondence is by no means 
exact, although in the case of the 30 dB signal-to-noise ratio curve it 
is quite close. The 50 dB curves are unreliable because of the pre- 
viously-mentioned difficulties of adjustment at low noise levels, so 
that the 40 dB curve represents the main discrepancy and obstacle to 
accepting the correspondence. In Fig. 2d the 40 dB curve closely 
follows that of the 30 dB curve (an analysis of variance showed no 
significant difference between the plotted points) while in Fig. 3d 
the 43 dB curve diverges from the 33 dB curve and dips down to zero 
at higher repetitions. We carefully examined this phenomenon and 

* The decay time of the phosphor may be considerably less than 1/30 second, 
but the sample is seen for a longer period owing to the persistence of vision. 

+ Having the same added noise pattern implies a 3 dB difference in their 
signal-to-noise ratios (see Appendix A). The interlaced picture has the higher 


actual S/N. 
+ Roughly estimated at between 1 and 3 dB. 
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conclude that it may well be explicable in terms of an interference 
or masking effect due to the interline flicker. At high levels of noise 
the masking is inoperative, but at lower levels it becomes predominant 
and acts to reduce the apparent noisiness. It also has a greater effect 
on the large-grain slow-moving noise produced by the higher repeti- 
tions. These conclusions are derived from personal observation and 
are tentative, but they do explain the shape of the Fig. 3 curves to 
some extent. A further point which should be clearly borne in mind 
in evaluating the curves is that the difference limen for random noise 
is probably at least 1 dB at 30 and 40 dB signal-to-noise ratios and 
greater at 50 dB signal-to-noise ratio; the increases in apparent noise 
level and variations in the apparent increase which are being con- 
sidered are therefore quite small and, to the average person, fre- 
quently indiscernible. 

Another point of interest in the graphs (more evident in Fig. 2 than 
Fig. 3) is the rapid rise in apparent noise level at small numbers of 
repetitions followed by a general flattening out or saturation above 
1/15-1/10 second (66-100 milliseconds). This corresponds roughly 
to the integration period or critical duration of the eye.* Below the 
critical duration, the eye sums “frozen” noise frames and sees increas- 
ing granularity with increasing frame-repetition. Above the critical 
duration the granularity stays constant, but the apparent spatial 
movement of the noise becomes slightly more noticeable with larger 
numbers of repetitions. It is unfortunate that in the region which 
shows most promise from the point of view of useful band-compres- 
sion without noticeable picture deterioration (2:1, 3:1, or 4:1 se- 
quential and 2:1 interlaced) the increase in noise level is most rapid. 

Notice finally, that subjects exhibited a slight bias in preference in 
the experiment toward the standard comparison picture. This can be 
seen in Figs. 2 and 3 by the positive intercept in apparent noise level 
increase at the 1:1 repetition ratio in all the graphs. This bias may 
have been due to slight differences in the brightness and contrast of 
the frame-repeated and standard pictures, as well as to difficulties in 
measuring signal-to-noise ratio to an accuracy of less than 1 dB. 


VI. CONCLUSIONS 


The apparent increase in noise level due to frame-repeating is 
fairly small: between 1 and 3 dB in the range of repetition ratios 
which are likely to be of practical interest for bandwidth-savings (up 
to 4:1 with sequential-scanning, 2:1 with interlacing). 
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With sequentially-scanned pictures the increase in apparent noise 
level is greater than with interlaced pictures, but interlaced pictures 
appear to be noisier to start with. Without further experimentation 
it is not possible to say with certainty whether a frame-repeated 
sequential picture is noisier than a frame-repeated interlaced picture, 
when each has the same signal-to-noise ratio. 

The rate of increase in apparent noise level is greatest in the region 
which shows most promise from the point of view of useful bandwidth- 
saving without picture deterioration (up to 4:1 frame-repetition with 
sequential scanning, 2:1 interlaced). 

The apparent increase in noise level is largely independent of 
signal-to-noise ratio, with one possible exception: low-level noise in 
interlaced pictures appears to be masked by interline flicker at the 
higher numbers of repetitions. 
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APPENDIX A 


Details of the Experimental Apparatus 


A block diagram of the experimental equipment is shown in Fig. 4. 
A 60 frame per second, 160-line, sequentially-scanned video signal 
was derived from a vidicon camera. To this signal random gaussian 
noise at one of two levels was added, dependent on the setting of 
switch S,. In position A, corresponding to frame-repetition (see linked 
switch S2), the noise level was completely and solely under the control 
of the experimenter (attenuator J), and in practice was set such that 
the signal-to-noise ratio (measured as the peak-white to black-level 
signal voltage divided by the rms noise voltage) was, in the case of 
line-sequential scanning, either 30, 40 or 50 dB at the display moni- 
tors. For line-interlaced scanning the levels of added noise were not 
changed, giving values of signal-to-noise ratios 3 dB higher at the 
display monitors, that is, 33, 43 and 53 dB. This was because the 
different manner of readout from the frame store with interlacing 
effectively reduced both signal and noise bandwidths by a factor of 
2. In the B position the noise level was determined in part:by the 
experimenter (attenuator II) and in part by the subject. The experi- 
menter would, in practice, set attenuator II to 8 dB less than attenua- 
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Fig. 4— Block diagram of the experimental equipment. 
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tor I, that is, to give a 22, 32 or 42 dB signal-to-noise ratio (sequential 
scanning). The subject’s attenuator was adjustable in steps of 1 dB 
over a range of 10 dB, so that, given, for example, a 32 dB setting on 
the experimenter’s attenuator II and 40 dB on attenuator I, the sub- 
ject could vary the signal-to-noise ratio over the range 32-42 dB. 
This proved to be an adequate adjustment for the subject to match 
A and B noise levels in almost all cases. 

Subsequent to the addition of noise, the video signal was passed 
through a low-pass filter of bandwidth 768 kHz to the frame-repeat- 
ing equipment.* This equipment, consisting of a PCM encoder, 
memory, decoder, and control logic, has been fully described in Ref. 
3. With switch S, in position B no frame-repetition occurred and the 
picture seen on the subject’s and the experimenter’s monitor screens 
was a standard 60 frame per second picture with added noise. With 
the switch in position A, however, every nth frame was stored in the 
memory and read out to the display monitors n times in succession. 
The experimenter was able to select any n in the range 1-11. By 
means of switch S, (independently controlled and not linked to any 
of the other switches) the experimenter could choose to display the 
contents of the memory in either line-sequential or line-interlaced 
fashion. In both cases readout from the memory was line-sequential, 
the interlacing being produced by subsequently blanking every alter- 
nate line of the readout (the blanking signal was delayed by one line 
period in alternate readout scans to give the interlacing effect). The 
subjective effect of this type of interlacing is exactly equivalent to 
conventional interlacing when a still picture is used. For example, 
consider a single stored frame of a plain white picture to which noise 
has been added. In both the conventional interlaced readout and the 
sequential alternate-line readout, the noise patterns as seen on the 
display monitor will be identical. Viewers will not appreciate that, in 
the alternate-line blanking case, the lines are scanned at twice the 
rate with pauses in between lines. With conventional interlaced read- 
out the noise bandwidth and the noise power are halved, and the peak- 
signal to rms noise ratio increased by 3 dB. A corresponding 3 dB 
increase in signal-to-noise ratio has, therefore, been assumed for the 
alternate-line blanking method used in this instance. 

In both interlaced and sequential cases the memory readout was 
displayed on a sequentially-scanned monitor. Linked to S3, but not 
shown, was an arrangement of attenuators in the video path to the 


* The characteristics of this filter are fully described in Ref. 3. 


538 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1969 


display monitors such that the screen brightness and contrast was 
unchanged in switching from sequential to interlaced pictures. This 
ensured that interlaced and sequentially-scanned pictures were seen 
under exactly similar conditions. 

Linked switches S,, Sz, and S; were controlled by an automatic 
timer which repeatedly switched between A and B conditions, each 
condition being presented for 214 seconds. To avoid visible transients 
these switches were arranged to operate only during the frame fly- 
back interval. Switch S; controlled two lights labeled A and B, visible 
to the subject, in order to indicate to him which presentation he was 
currently viewing. Linked switches S, and S, enabled the experimenter 
to set up a condition on his own monitor before presentation to the 
subject. 


APPENDIX B 


Test Conditions 


The viewing distance for each subject was approximately 25 inches, 
the picture size being 5 inches X 5 inches. Screen highlight and low- 
light luminances were maintained at 60 foot-lamberts (206 cd/m?) 
and 3 foot-lamberts (10 cd/m?) respectively. Ambient illuminance 
was approximately 5 foot-candles (54 lm/m?). 

Noise level matching was carried out by the method of adjustment.® 
Subjects were introduced to the method in the following way. On ar- 
rival for their test, an A-B pair was presented to them (the A and B 
presentations occurring successively on the same screen for 21% sec- 
onds each) in which the B presentation was noticeably noisier than 
the A. It was then demonstrated that by adjustment of the step at- 
tenuator (see Fig. 4) it was possible to lower the noise level in B 
until it matched the noise level in A. Subjects were invited to try the 
matching for themselves, and in all cases, with very little practice, 
succeeded in mastering the technique. It was explained that a num- 
ber of similar pairs would be presented to them, and that for each 
pair they were required to adjust the attenuator until the noise levels 
in the A and B pictures were the same. An unlimited time was allowed 
for the adjustment, most subjects taking about one minute, with a few 
taking as much as two minutes. When the pictures were matched to 
their satisfaction the subjects reported the attenuator setting to the 
experimenter. The quantity: Attenuator I setting — (Subject’s at- 
tenuator setting + attenuator II setting) was taken as the apparent 
increase in noise level due to frame-repeating. 
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All of the presentations were made with the same still picture, a 
portrait of a girl (Fig. 1). Had a moving picture been used, the sub- 
ject’s judgment would have been confounded by the motion break-up 
with frame-repetition. By using a still picture, the only visible dif- 
ference between the two pictures was in respect to noise level. 

Based on the results of preliminary experiments, repetition ratios 
of 1:1, 2:1, 4:1, 7:1, and 10:1 (60, 30, 15, 8.6, and 6 new frames per 
second respectively) were chosen to cover the range of interest for 
sequentially-scanned pictures. For interlaced pictures 1:1, 2:1, 3.5:1, 
and 5:1 (30, 15, 8.6 and 6 new frames per second respectively) were 
used. The 1:1 repetition here was identical to a standard or non- 
repeated picture, and was included as a check on the validity and ac- 
curacy of the A-B comparisons. 

The 27 A-B presentations, consisting of all combinations of the 3 
signal-to-noise ratios, the two methods of scanning and the various 
frame-repetition ratios (5 for sequential, 4 for interlaced scanning) 
were presented to subjects in random order, the order being different 
for each subject. Subjects made only one match per A-B pair. Twenty- 
four subjects were tested and the mean increases in apparent noise 
level, together with the standard deviation between subjects, were 
calculated for each of the 27 conditions. 
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The Generation and Accumulation of 
Timing Noise in PCM Systems— 
An Experimental and 


Theoretical Study 


By J. M. MANLEY 
(Manuscript received September 30, 1967) 


Three sources of timing noise in a self-timed regenerative PCM re- 
peater, namely, tank circuit mistuning, amplitude to phase conversion, 
and pulse shape, were studied both experimentally and theoretically. We 
discuss how these noises accumulate and combine along a chain of repeaters. 

The theoretical work ts from the viewpoint of frequency analysis which 
leads easily to the spectrum of the timing noise. We first give a simple form 
of this theory applicable in a number of cases, and then a more general 
form useful in other cases, which shows the approximations and limitations 
of the simple theory. 

We found that the spectrum of timing notse caused by tank circuit mis- 
tuning has no energy at zero frequency and because of this fact, timing notse 
from this source does not build up indefinitely along a chain of repeaters 
but soon reaches a limit. On the other hand, the spectrum of timing noise 
caused by amplitude to phase conversion does have energy at zero frequency; 
thus, timing noise from this source increases indefinitely along a repeater 
chain. Some of the timing noise ts attributable to pulse shape alone and in 
some cases may include a very low frequency part. This latter comes about 
through the small energy near the harmonics of the pulse rate in the tuned 
circuit response and the aliasing of this energy down to very low frequencies 
by the sampling process used in measuring the phase deviation or in 
generating the retuming pulses. 


I. INTRODUCTION AND SUMMARY 


A considerable amount of work has been done and results published 
on the subject of timing noise in pulse code modulation (PCM) sys- 
tems. The material in this paper comes from work which began as 
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an experimental investigation. This led to some successful simple 
theories and generalizations along somewhat different lines from 
those followed previously. 

It appears to be impossible for a regenerative repeater to perfectly 
restore a train of signal pulses to its original form because of the 
difficulty of obtaining a perfect timing source at a repeater location 
remote from the transmitter. The most widely used simple method of 
obtaining a timing wave is to pass the incoming pulse train, or some 
modification of it, through a narrowband resonant tank circuit, tuned 
as nearly as possible to the pulse rate. Since the tuning of the tank is 
unlikely to coincide with the pulse rate, since the bandwidth of this 
selective circuit is finite, and for other reasons, the derived timing 
wave is not perfect. Through this imperfect timing source, a certain 
amount of timing noise is added at each repeater to that already 
present in the incoming signal train. 

Because this noise arises from imperfections in the system, it may 
be considered to be analogous to the modulation interference noise 
in amplitude systems caused by small departures from linearity in 
various components. Thus, if the narrowband tank circuit could be 
centered exactly on the pulse rate and kept there, and if the pulse 
generating circuits were always triggered exactly at a zero crossing 
of the timing wave, and if nonlinearity were not required to generate 
the pulse rate, then major sources of timing noise at a regenerative 
repeater would not exist. 

As pointed out by W. R. Bennett and others, the principal effects 
of timing noise are two: 


(1) At any one repeater, the phase of the timing wave may be dis- 
placed in an irregular way from the proper place for optimum gating 
of signal pulses so that, at best, the tolerance of the system to noise 
is reduced and, at worst, errors in recognition of pulses or spaces are 
made. 

(it) Even if the sequence of pulses and spaces arrives at the receiver 
with no errors, the decoded signal samples will be irregularly spaced, 
thus introducing into the signal circuits a distortion which has the 
frequency of the deviation. The seriousness of this effect depends on 
its magnitude and the character of the signal. This effect is analyzed 
by W. R. Bennett in Ref. 1. 


A program of measurements for studying the properties of this tim- 
ing noise, and how it accumulates along a chain of regenerative 
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repeaters, was begun because of the difficulties which had been en- 
countered in trying to calculate the noise. It was planned to isolate 
the sources of timing noise and so consider each one separately and 
then in combination. The work described here is not concerned with 
the effects of the noise on the system. 

Measurements were not made on chains of varying numbers of real 
repeaters. Instead the chain is simulated by one real special repeater 
and a multitrack tape recorder as indicated by Fig. 1. While the 
previous repeater output is being reproduced from two tracks of the 
recorder and used as input for the real repeater, the new output is 
being recorded on two other tracks. One of each pair of tracks is used 
for the pulse train and the other for timing information. Because no 
recorder has steady enough speed, the timing wave cannot be recorded 
directly. Instead, the phase deviations are detected and these are 
recorded. During playback, the timing wave is reconstructed with a 
phase modulator. The recording is sufficiently long so that statistical 
fluctuations are well smoothed. For most of the work the pulse train 
consisted of random unipolar pulses at a 1 kHz rate. 

The first results obtained were on the noise caused by mistuning 
of the timing tank in one repeater and then two in tandem. Study of 
these results led to the development of a simple theory for the gen- 
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Fig. 1— Simulation of regenerative repeater chain. 
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eration of the timing noise and its accumulation along a chain of 
repeaters. Subsequent work demonstrated that this theory may be 
used to calculate the noise satisfactorily, not only for longer chains, 
but for amplitude to phase conversion sources of timing noise as well. 
Very good agreement was obtained between the noise calculated from 
this theory and that measured. 

A brief summary of these results obtained when narrow rectangular 
pulses are used follows: 

The spectra of timing noise at each of the repeaters in a chain of 
six, all mistuned alike, are shown in Fig. 2. This noise is designated 
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Fig. 2—Spectra of timing noise caused by 0.1 percent mistuning of timing 
tank, Q = 100. Random rectangular pulses, 10 percent duty factor. N = number 
of like repeaters in chain. 
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Type A; the most important characteristic of these spectra is that 
there is no energy at zero frequency. Because of this, the peak of the 
spectrum at the end of a chain never becomes larger than four times 
the peak at the first repeater, no matter how long the chain is. The 
root mean square (rms) value of the total noise increases to about 
twice the amount at the first repeater as we go along a very long 
chain. If, as is most likely, some of the repeaters in the chain are 
mistuned in the opposite direction or to a smaller degree, the noise 
at the end is smaller than that above. Thus it is seen that mistuning 
of the timing tank is not a factor in the accumulation of large amounts 
of timing noise in a long chain of regenerative repeaters. 

The situation is different, though, if we have a pulse generator 
whose trigger point is offset from a zero crossing. The timing noise in 
this case is a direct consequence of the amplitude variation of the 
timing wave, which variations have a spectrum with nonzero value 
at zero frequency. As shown in Fig. 3, this causes the rms value of 
very low frequency timing noise to increase linearly at successive re- 
peaters in a chain having equally offset triggers in each. This noise 
is designated type B. The total noise at the end of the chain increases 
without limit as the number of repeaters increases. The total amount 
varies inversely as the Q of the tank circuits. 

It was demonstrated that the theory applies also when both mis- 
tuning and amplitude-to-phase conversion are present simultaneously. 

Spectra of timing noise caused by pulse shape alone are shown for 
several particular shapes in Fig. 4. While the total noise for the wider 
pulses is fairly large here, because it is spread over a wide frequency 
band, the magnitude of the undesirable very low frequency com- 
ponents is quite small. For example, the total noise for the asym- 
metrical overlapping pulses is only one-fifth the amount per repeater 
measured in the T1 system. The results of this investigation in- 
dicate that some form of amplitude-to-phase conversion is probably 
the greatest source of very low-frequency timing noise. 

The idea that, for the propagation of phase deviations, the chain 
of regenerative repeaters resembles a chain of tandem low-pass re- 
sistance-capacitance (RC) filters follows from considering the phase 
deviation to be a modulation of a carrier wave at the pulse rate. Ex- 
periments verified this idea which had been suggested earlier.® 

A brief outline of the simple theory and method of calculation 
will now be given; a more detailed description will be made in Sec- 
tions 2.1—2.5. Consider the spectrum of the incoming pulse train, 


546 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1969 




















= 


O MEASURED 
Vv 





POINTS AT f=0 
CALCULATED ARE SHOWN FOR 
N=6,5,4,3 








N=NUMBER OF LIKE “REPEATERS” 





IN 0.625 HZ BAND 








DEGREES R,.M.S, 


























0 
0,625 1.25 2.5 6.25 {2.5 25 62.5 
FREQUENCY IN HZ 


Fig. 3—Spectra of timing noise caused by timing wave amplitude variations 
and trigger circuit offset from zero. Tank tuned to pulse rate. Random rectangular 
pulses, 10 percent duty factor. N = number of like repeaters in chain. 
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Fig. 4 — Timing noise spectra caused by various pulse shapes. 


which will contribute very little timing noise itself if the pulses are 
narrow.” This spectrum consists of discrete components at harmonics 
of the pulse rate and a broadband of a special kind of noise.t Next 
consider this broadband of noise to be divided into small evenly 
spaced bands, each small band replaced by a single frequency com- 
ponent having the same power as the small band. Taking these side- 
frequencies in pairs about the pulse rate, we may think of the spec- 
trum of the pulse train in the vicinity of the pulse rate as a carrier 
wave amplitude modulated by a number of small components. 

The tuned circuit by which the pulse rate fundamental is selected 
from this spectrum to provide a timing wave, also admits some of the 
noise side frequencies which are still symmetrical if the tuned cir- 
cuit is centered exactly on the pulse rate. When this tank circuit is 


*The meaning of this is brought out in Sections 4.3, 2.6.5, and 2.66. 


548 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1969 


detuned from the pulse rate, the side-frequency pairs in the tank 
response are no longer symmetrical about the carrier. Sometimes the 
dissymmetry is described by saying that a quadrature carrier term 
has been introduced. The dissymmetry in amplitude, or phase, or 
both, is equivalent to phase modulation of the timing wave. This 
phase modulation of the timing wave is transferred to the outgoing 
pulse train in the regeneration process. 

Next, assume that these asymmetrical side frequencies (which are 
another description of the above phase modulation), after being at- 
tenuated and phase shifted in transmission through the narrowband 
tank of the second repeater, would add directly to the corresponding 
ones newly generated by the detuning of the second tank. Phase 
deviation calculated from this assumption agrees very closely with 
that measured, not only after two repeaters, but after many have 
been traversed. 

If the amplitude modulation at the tank output, corresponding to 
the symmetrical components, is not entirely removed by a limiter, 
that’ remaining may cause further phase modulation. For example, if 
a pulse generating circuit is supposed to trigger at a zero crossing of 
the timing wave but actually triggers a few degrees away from zero, 
amplitude variations of the timing wave will be converted to phase 
variations. The magnitude of these phase variations and their in- 
crease along a chain of repeaters may be successfully calculated us- 
ing the same methods described above. 

The simple theory applicable in a number of cases, and which leads 
easily to the spectrum of timing noise, is inadequate for pulse shapes 
other than narrow ones. Here the pulse train itself is now a source of 
timing noise. For example, if the pulses have a finite width, a small 
additional amount of noise (type B) can arise, although not in all 
cases. The limitations of the simple theory and how it fits into a more 
general theory is discussed in detail in Section 2.6. 

The more general theory, described in Section 2.6, is also developed 
from a frequency viewpoint and is based on analysis by 8. O. Rice to 
whom I am indebted for this work. With it the amount of timing 
noise in the situations of the previous paragraph were calculated. Also, 
this theory was used to calculate the timing noise for raised cosine 
pulses two time slots wide, hence with large enough overlapping so 
that a non-linear device is required in order to derive the pulse rate 
fundamental. It was found that in this case, the timing noise is Type 
A (that is, it does not build up in a long chain of repeaters) with a 
small qualification discussed in Section 2.6.6. 
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Another source of type B noise (which builds up indefinitely) is 
in any low-frequency distortion of the pulse spectrum if this is fol- 
lowed by a nonlinear operation of any kind. 


II. THEORY OF PHASE NOISE GENERATION AND ACCUMULATION 


As mentioned in the Section I, the simple theory is described first, 
with the more general one and its relation to the simple one being 
discussed in Section 2.6. The principal area in which the simple theory 
is satisfactory is that in which the pulses are narrow. In this case the 
pulse train itself causes very little timing noise, and so other sources 
may be considered separately. 


2.1 Spectrum of Narrow Pulse Train 

It is assumed that the message pulses are represented by a random 
train of narrow, rectangular, unipolar pulses. By random pulse train 
is meant one having regularly spaced pulse positions which are filled 
or not at random. Although most of the work was done for an average 
pulse density of one-half, the result would not be appreciably different 
except in magnitude, if this parameter differed somewhat from the 
value one-half. 

The spectrum of this train has been calculated by W. R. Bennett.* 
Part of the spectrum is a series of harmonics, the fundamental of 
which is the pulse rate, and the magnitudes of which are determined 
by the shape of the individual unit pulses. The spectrum of the other 
part has the same shape as the envelope of harmonics, and is con- 
tinuous and therefore is a noise. Bennett points out that while this 
noise is like thermal noise in some respects, that is, for example, in 
the proper frequency band the two sound alike; nevertheless it has 
a phase structure which thermal noise does not. 

If the original pulses are rectangular of height V, and duration + 
and occur at regular intervals 7 = 1/f, with a probability of 1/2, 
Bennett’s calculation shows that the mean square value of the funda- 
menal term at f = f, is 


A2/2 = (V2/2n°) sin? a7/T (1) 


and that the mean square value (in a band B Hz wide) of the noise 
part is 


W(f) = B(V?/2n?)(f./f?) sin? (arf). (2) 


From a somewhat different point of view, the spectrum of this 
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random train of narrow pulses may be calculated by first considering 
the train to consist of repeated blocks of random pulses, each block 
being N pulse periods T in length. The Fourier series representation 
of this train consists of harmonics of the pulse rate f, plus single 
frequency “noise” components spaced f,/N apart. The harmonics 
have nonzero average values. While the noise components have zero 
average values, their average powers are nonzero. If N is made to 
approach infinity, this spectrum approaches that calculated by Ben- 
nett. The representation by finite components means, in effect, that 
a band of noise f,/N Hz wide is replaced by a discrete term having 
the same mean square value. If S and A; are the rms amplitudes of 
one of the noise components and the pulse rate fundamental, respec- 
tively, then, as shown in (113), Section 2.6.5, 


2,42 _ (Bf.) sin* (xfr) 
oh aes f’ sin? (rrf,) ’ 8) 
where f-/N has been replaced by B. This agrees with (1) and (2) 
from Bennett’s calculation. In the vicinity of the fundamental, we 
have 


S/A, = (B/f.)4. 


The spectrum of this representation of the pulse train is shown in 
Fig. 5. 

Near the pulse rate component, the noise amplitudes on both sides 
of it are nearly equal because the pulses are narrow. For example, 
when 7/7’ = 0.1, values of S for components 2.5 percent above and 
below f, differ by about 0.2 percent. Hence an approximate representa- 
tion of this region of the spectrum is 


E, = A,{l + >> a, cos (2rkf,t/N)] cos 2rf.t (4) 


where 
a, = 28/A; & 2(B/f.)* (5) 


which describes the input as an amplitude modulated carrier. A more 
accurate representation would include a modulated quadrature term 
to account for the slight dissymmetry of side frequencies. 

It is the regular spacing of the random pulses which gives the 
phase structure to the noise spectrum, causes zeros of the wave EH; 
to appear at regular intervals 7 = 1/f, apart, and which makes pos- 
sible the representation in (4). 
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Fig. 5— Pulse pattern and spectrum. 


In the simple theory, attention is given only to that part of the 
pulse spectrum in the vicinity of the pulse rate fundamental as indi- 
cated by the representation (4). The strength of the noise terms with 
respect to the fundamental is obtained by the indicated statistical 
averaging of the pulse train components with the result (3). The 
reasons why these simplifications are satisfactory are discussed in 
Section 2.6. 

In the Section 2.2, the response of the tuned circuit to this re- 
stricted portion of the pulse spectrum, considering the noise terms 
to have fixed amplitudes, is calculated. 


2.2 Response of Tuned Circurt to Narrow Pulses 

In calculating the response of the tuned circuit to (4), we need 
to consider only one representative modulation term of frequency 
q/2r = kf./N, namely 


Ey, = A,{1 + 2¢S/A;) cos gt] cos w,t. (6) 


The response to the two side frequencies S cos (w, +q)é¢ of this one 
term is 
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Eos = Si COs (wet + gt + 6,) + Se cos (w,t —= gt +- 62) 
S; cos [wt + ¢) + gi + (4 — ¢)] 
+ S2 cos [w.¢ + ¢) — gt — (e — 4)], (7) 


where 6, and 6, and ¢ are the phase shifts received by the two side fre- 
quencies and the carrier respectively in going through the tuned circuit. 
If the tuned circuit is resonant at w, , the symmetry of the side fre- 
quencies about w, in both amplitude and phase which exists in (4) is 
preserved in the response. But if it is resonant at w, , different from w, , 
the response side frequencies are unsymmetrical as indicated in Fig. 6. 
In this case, they may be resolved into a pair with even symmetry 
and a pair with odd symmetry, or into a component in phase with the 
carrier and another in quadrature with the carrier. That is, we get 


E,, = A, cos [wt +¢+ gi + ¢.] + A, cos [wt + ¢ — gi — ¢,] 
+ A, cos [wt +o + qit+¢,.] — A. cos [wt + ¢ — gi — ¢,] 
2A, cos (qi + 9,) cos @w,¢ + ¢) 


I 


ll 


— 2A, sin (gi + ¢,) sin (@.¢ + ¢) (8) 
where 
2A, = {Si + S} + 28,82 cos [(v — 62) — (0, — ¢)]}}* 
2A, = {Si + S: — 28,8, cos [(e — %) — (6: — oI}! — (9) 
tan wh Si sin (6; =o ¢) + S2 sin ( = 6.) 
a S, COS (6; = ¢) + Se COS (y — 6.) 
a S2sin (gy — 62) — Sisin (6, — ¢) | 


~ S_ cos (pg — 62) — S; cos (6, — ¢) 


Thus the resonant tank response to the amplitude modulated wave 
(6) is 


E, = [A, + 2A, cos (gt + ¢,)] cos (w.t + ¢) 
— [2A, sin (gt + ¢,)] sin (wt +). (10) 
When S/A;, is small, as at present, (10) is approximately described by 
E, © All + 2(A,/A.) cos (gt + ¢,)] cos [(w.t + 9) 
- + 2(A./A.) sin (qt + ¢)]- (11) 


The tuned circuit to which the train of random pulses is applied 
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Fig. 6— Resonant circuit characteristics. 


in order that the pulse rate fundamental may be selected is described 


by 


Let 


Then 


I : 1 
Bom YG2nf) = an oan (12) 
B+ set — 5) | 
wLlC = 1 
w,L/R = Q 
wo = nw, ae Aw 
Aw/o, = x. (13) 
V(j2nf.) = = (14) 


R 
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and 


. er a2 
VGaef)/ VG.) = - om fae 


This general formula will be used in the treatment for wide pulses. 
Here we will be concerned only with frequencies in the neighborhood 
of f, and so we set n = 1. Thus 


Y,(j2rf)/Y(j2rf.) = 


(15) 


ite 
1+ 2 + j2Qx(1 + 2/2) 
When Aw/w, = x is small, a satisfactory approximation to the trans- 
mission Y,(j27f)R is 


(16) 


; 1 
Yi(2af)R ©; 10 Aaa (17) 
The phase of Y, is 6, where tan 6 = —2QAw/w,. Let 
Lo = 5/w,, Ly = Q/w., (18) 


Y= LX, + XQ, Ly = Lo — Ly, 
where § is the amount of tank circuit detuning from the carrier and 
q is the modulation frequency. It is convenient also to write 
S,/S = | Y[j2af.(1 + x + 2,)]|R = y,; = cos 4; 
So/S = | Y[j2rf.(1 + x. — x,)] | R = yo = cos 62. 


The amounts of amplitude and phase modulation in the tank 
circuit response (11) can now be given explicitly. Writing 


= (EL 
A, \A,/\AJ/\ S 7’ 
then substituting (5) for the first ratio on the right, and the upper 
of (9) for the third ratio, and noticing that 


A,/A; = | Y(j2rf.(1 + «.)]| R, 


(19) 











we have 
24,/A, = (B/f.)'(1 + tan’ ¢)} 
7 -{yi + y + 2yrye cos [(v — 62) — (6. — g)]}}# ~— (20) 
‘Similarly, 
2A,/A, = (S/A,)(A:/A,)(24./8) = (B/f.)*( + tan? 9)? 
| {yi + y2 — 2yrye cos [(y — 2) — (6 — @))}?. 2D) 
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When the amount of detuning is small (0.1 percent detuning with 
€ = 100 makes tan » = 0.2), the expression inside the large radicals 
of (20) and (21) may be simplified so that 





____ 2B/f.! 
se eae Ore 2) 


i (2qQ/w.)* ’ 


and also ¢,  @. 

The two expressions in (22) and (23) describe the amplitudes of 
the amplitude and phase modulation in (11) which are the responses 
of the resonant tank circuit to an amplitude modulated wave rep- 
resenting part of the incoming pulse train. Before discussing the 
meaning of these results for the generation of phase modulation or 
how it accumulates in a chain of repeaters, we will calculate the 
amount of phase deviation generated by another possible source 
within the repeater. 


2.3 Amplitude to Phase Conversion Factor of Offset Trigger 

In an ideal repeater, a perfect limiter following the resonant tank 
circuit would remove all the amplitude modulation from the derived 
timing wave (11) which would then, at one of its zero crossings, 
trigger a pulse generator as a part of the retiming process. But in 
a real repeater, the limiting would not be perfect so that some ampli- 
tude modulation remains on the timing wave; also the trigger point 
may have drifted away from the zero crossing. This is one way in 
which amplitude variations of the timing wave are converted to 
phase variations in a regenerative repeater. The diagram of Fig. 7 
illustrates the conversion. 

Referring to Fig. 7, where the triggering level has been offset By 
the bias b or by the anette yo which are related by 


siny, = b/A,, (24) 


it is seen that the change, Ay, in triggering angle for a change, AA, 
in amplitude is 


Ay = — tan y,(AA/A.,). (25) 


The amplitude variation in (11) at the tank circuit output, is 
reduced by a factor Ky, in the limiter which follows, so that the input 
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AMPLITUDE -——> 





TIME ——> 


Fig. 7 — Amplitude-to-phase conversion in trigger with offset: bias. 


for the pulse generator is 


Afi + K,(2A,/A.) cos (gt + ¢,)] cos [(w.t + ¢) 
+ (2A,/A.) sin (qt + ¢.)]. (26) 


Hence the phase variation introduced by the offset trigger, using 
K,2A,/A, from (22) for AA/Ao, is 


_ —2(B/f.)?K, tan y. 


ae ia aga eee eel 


(27) 
2.4 Application of These Results to Timing Noise 
‘The phase variations (referred to the carrier fundamental) of the 


pulse generator driven by (26) are then given by the sum of Ay from 
(27) and 2(A,/A.) sin (qt + ya) from (28), that is, 


o, = BBL) (80/w)(4Q/e.) 
1 + (2qQ/w.)° 
(1 +-(2qQ/w.)° 


From the magnitudes of the two components of (28), the spectra of 
timing noise caused by tank circuit mistuning and by amplitude to 


sin (gt + 9a) 


cos (qt + ¢.) (radians). (28) 


TIMING NOISE 5D7 


phase conversion are determined. These are plotted in Fig. 8 with 
Ky, tan yo = 1. The phases g, and ¢, are plotted in Fig. 9. The first 
term in (28) specifies the spectrum if mistuning alone is present; 
the second term applies if mistuning is zero and there is amplitude 
to phase conversion. It is seen that the spectra in these cases are 
quite different at very low frequencies near g = 0. The phase modula- 
tion for mistuning alone is zero at zero frequency and has a maximum 


max (2A,/A,) = 2(B/f.)*(6Q/w.) (29) 
at 


2qQ/w, = 1, (30) 


while that for amplitude to phase conversion alone has a maximum 
value of 2(B/f,) Kr, tan y. at zero frequency. This difference be- 
tween the two spectra at low frequencies has a very important effect 
on the accumulation of timing noise in chains of repeaters, as will 
be seen in Section 2.5. Second, we see that the first component depends 
directly on the amount of detuning, 8, and is zero for zero detuning. 
See Section 4.1. This emphasizes what Bennett has said about the 
difference between the noise spectra of random pulses and thermal 
noise.t In the latter case, phase noise would not be zero for zero 
detuning. 

The good agreement between phase deviations calculated in this 
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Fig. 8—Amplitude spectra of symmetrical (A;/A) and_antisymmetrical] 
(A./A) side frequencies caused by tank mistuning (calculated), 
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Fig. 9— Phase spectra of symmetrical (®,) and antisymmetrical (.) side 
frequencies caused by tank mistuning (calculated). 


way, and measured values, has already been mentioned in the intro- 
duction and is shown in the curves of Fig. 2 and 3. 

In one sense both of these effects are amplitude-to-phase conversion 
with the amplitude modulated carrier representation of part of the 
pulse train, as in (4), being the original amplitude variation. From 
this viewpoint, in both (23) and (27), 


2(B/f.)? 

is the applied amplitude variation; the factor 

[1 + (2qQ/w.)"? 
is the attenuation of the tuned circuit; while 

—K, tan y, 

is the amplitude-to-phase conversion factor in (27), and 

4(6Q/,)(qQ/e.) 

[1 + 2¢Q/w.)"]* 


is the corresponding conversion factor for mistuning from (23). 
When both tank mistuning and trigger offset are present in the 
timing wave path, their combined effect may be calculated by adding 
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the two as suggested in (28). The correctness of this has been verified 
by experiment as seen in Fig. 10 where both measured data and 
values calculated according to (28) are plotted. This result emphasizes 
again that both of these phase modulation effects have a common 
source in the special noise side frequencies about the pulse rate in 
the spectral representation of the random pulse train. 


2.5 Accumulation of Phase Modulation in a Chain of Repeaters 


Next consider how phase modulation accumulates in a chain of 
repeaters when the same amount and kind of phase modulation is 
generated at each repeater of the chain. 

Assume that phase modulation generated in the derived timing 
wave at repeater 1 is 


vg, = ®, sin (qt + ¢.) (31) 


and that this is passed along unchanged by the regenerator. Then 
at the input to repeater 2, this is equivalent to the presence of a pair 
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Fig. 10 — Spectrum of timing noise caused by tank detuning and trigger offset. 
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of antisymmetrical side frequencies 


S,/A = +(®,/2) cos (wt + ¢ + gi + ¢.) (32) 


about the carrier. The transmission of these side frequencies through 
the tuned circuit of repeater 2 is governed by expressions (11), (20), 
and (22), developed above for the transmission of amplitude modula- 
tion. In using these in the present circumstance, the amplitude #,/2 
in (82) takes the place of S/A; in (20). The response of the tank 
circuit of repeater 2 to these side frequencies is then approximately 


ae 2/2 xy cos (wf +o + git o + %) (33) 


[1 + (2¢Q/w,)"] 
if it is assumed that the pulse amplitude, and hence that of the 
carrier, are the same at repeater 2 as at repeater 1. This expression 
is independent of mistuning when the degree of mistuning is small, 
as we have assumed. In addition to this response we have the pair 
of antisymmetrical side frequencies, as expressed by (32), but now 
generated at repeater 2, namely 


-++ (#,/2) cos (wt +¢ 4 qt + ¢,). (84) 


Thus at the output of the tank circuit of repeater 2, the antisym- 
metrical side frequencies are represented by the sum of these two, 
that is (34) and (88) or, 


4: (B,/2) 08 wt +o gt ou) 


1 
SS SSO OP-ART 
[1 + (2qQ/w.)*}* 
Therefore the total modulation #. at the output of repeater 2 is 
seen to be 


cos(w,fé+tetgqteot ed}: 


b, = | d, + ®, (cos 6) exp (76) | (35) 
where cos 6 has been substituted for 
fo ee 
[1 + (2¢Q/.)"]? 
according to (19), and », has been replaced by its approximation 6. 


Carrying through the same process for repeater 3, we find the phase 
modulation at repeater 3 output to be 


d, = 4, | 1 + (cos 6) exp (j8) + (cos 6)” exp (j26) |. (36) 
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These show that, while the actual combining process at each repeater 
involves the direct addition of side frequencies, we can drop the carrier 
reference and consider the generation and propagation of modulation 
alone. And the ®, generated at each repeater need not be only the 
component (23), but may be the more general one (28) in which 
mistuning and amplitude to phase conversion effects are combined. 
The process begun in (36) may be generalized to give the amount 
of phase noise at the Nth repeater in a chain of N-like repeaters. 
This is 
~— : bens 
Ee | = 9] 
where Y,R has been written for (16). The approximation (17) to 
Y,R and the relation (19) are used so that 





» (87) 





by = 6, |(Y)y| = 


1 ; 
YR & ioe ro = (cos @) exp (j6), (88) 


where 2, = q/w, aS defined in (18). In (87), (Y)y is not an admittance 
as Y, is, but is a transfer function. By direct expansion of (Y)y , using 
(38), 


_ L+ (cos 6)"" = 2(cos 6)" cos NO, 


sin’ 6 (39) 


| (Yw P 
(Y)y may be considered a sort of phase deviation transfer function 
for a chain of N repeaters when ®, is the phase deviation generated 
in each repeater. |(Y)w| is plotted for N = 4, 30, 100 in Figs. 11, 
12, and 18. 
Since 6, and (Y)y are functions of the phase deviation frequency 
q, | ®y | will describe the spectrum of accumulated phase noise. Con- 
sider first the case of tank circuit mistuning alone. From (23), we have 


6, = K, Tee = 2K, sin 6 cos @ (radians) (40) 
where 
K, = 2(B/f.)(6Q/e,). (41) 


Then for mistuning, 
| dy |? = 4K? cos” 6[1 + (cos 6)” — 2 (cos 9)” cos NO]. (42) 


This is a rather unwieldy expression when J is large, but it may be 
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Fig. 11— Effective transfer characteristic of four tandem timing tank circuits 
to modulation on pulse train. 


approximated for the purpose of finding its maximum value by 


| dy | 4K, (cos 0)°*"”™ sin (N 6/2) (43) 


since 6 is small for large N at the maximum of | ®y |. The expression 
in (43) is maximum with respect to 6 when 


tan 6,, tan (N6,,/2) = N/(N + 2). (44) 


A list of maximum values for ®y are given in Table I: 
These are plotted in Fig. 14 along with four of the full calculated 
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Fig. 12— Effective transfer characteristic of thirty tandem timing tank circuits 
to modulation on pulse train. 
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Fig. 18 — Effective transfer characteristic of one hundred tandem timing tank 
circuits to modulation on pulse train. 


spectrum curves of Fig. 2. It is seen from (48) that 4K, is the largest 
value ®y can have. 

The spectrum of &, is plotted along with the (Y) y function in Figs. 
11, 12, and 13 to suggest why the peak values of ®y do not increase 
indefinitely with NV. 

To get the total mean square “power” of @y, | ®y |? in (42) is 
integrated with respect to x from x = 0 to o or, with respect to 6, 
from 6 to 0 —7/2. Since the expression for Sy in (42) gives the peak 
value, we have for the total mean square power Py, 


1 a nas 2 
Py=5[ lev(OF af, (45) 


where a value of 1 Hz is used for the bandwidth B in K,. Evaluation 


Taste I—Maximum VALUES FOR ®,, 





N Om Qam Dm 

1 a /4 0.5 Ky exact 

4 0.49 0.267 2.384 Ky exact 

4 0.463 0.249 2.290 Ky approximate 
10 0.2525 0.1290 3.14 Ky, approximate 
30 0.0997 0.049 3.68 Ky approximate 
50 0.0603 0.0892 3.80 Ky approximate 


100 0.0308 0.0154 3.90 Ky approximate 
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_POSITION OF CALCULATED VALUES 
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Fig. 14 Calculated spectra of timing noise caused by tank circuit detuning. 


of this integral gives 


ie 1 1 (2N)! dena 
Py = 2rQ o {1 = 5va Ben Bu (radians) (46) 
ignoring the small difference between w, and w, . For N = 1, we have 
and for N 2 4, Py is approximated very closely by 
o 1 1 
Py ~ 2nQ Ps 1 = gN-1 + inp (48) 


Twice the quantity in the braces is plotted in Fig. 15, where exact 
values from (46) are used through N = 4. 
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The expression in (47) for P,; was derived in a different way by 
W. R. Bennett.t The expression (46) for Py has been derived inde- 
pendently and differently by 8. O. Rice. 

When there is no tank circuit mistuning and amplitude to phase 
conversion is the only source of phase deviation, that generated at 
each repeater is given by (27), that is, 





a 2(B/f.)’K x, tan Yo 
[1 + (2¢Q/e.)"]* 
= 2K, cos @ (radians). (49) 
In this case 
| by |? = 4K> a (cos)” [(cos 6)* + (cos 6)°” — 2 cos N6}. (50) 


When (cos 6)” > 0.8, that is for 6, and hence q, sufficiently small, the 
sum of the first two terms within the bracket is very close to 2. For 
this condition, by may be approximated by 


~~ a+y2) Sin (N 6/2) 
| by | = 4K,(cos 6) ear see 


2K ,N[cos (N6/2)|(cos 0)°**”™, (51) 


which shows that the phase deviation very near zero frequency 
increases directly with the number N of repeaters in the chain. This 
is in contrast with the similar result (43) for mistuning where the 
largest value of | Sy | is only four times that of | 4 |. (Y)y is the same 
for both. The difference arises because of the difference in spectrum 
shape of the generated phase deviation in the two cases. In mistun- 
ing, |,| is zero at zero frequency, while in amplitude to phase 
conversion effects, |®,| is flat and nonzero for very low frequencies. 
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Fig. 15— Calculated total timing noise caused by tank circuit mistuning as a 
function of the number JW of like repeaters in a chain. 
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To find the total mean square “power” of ®y, the integral (45) 
is calculated, using (50) this time instead of (42). The value of this 
integral is given in a paper by Byrne, Karafin, and Robinson.® It is 


1 (@N-)D! (52) 


_ the { ah 

Py 0 KAN 2 T-Oy — HIF 

which shows that the total phase noise power increases directly with 

the number WN of repeaters in a chain and varies inversely with the Q 
of the tank circuits. 

In Ref. 8, the result (52) and the spectrum (50), or something 
closely related to it, were derived by assuming that at each repeater 
there was a noise source (nature and magnitude unknown) with a flat 
spectrum and ‘power’ density 2K? . The process followed is similar 
to work first done by R. C. Chapman of Bell Telephone Laboratories. 
Also in the paper of Ref. 8, the results of measurements of accumulated 
timing noise in an experimental 7'l system are given and the spectra 
of these are like those of Fig. 3. 


2.6 Spectrum and Phase Noise of Wide Pulses 

In most practical systems it is not desirable for the transmitted 
pulses arriving at the timing circuit input to have the narrow shape 
considered in the previous section of the paper. It is to be expected 
from the above discussions of the effects of dissymmetry between 
upper and lower side frequencies, that wider pulses having spectra 
with these characteristics would introduce phase noise in the derived 
timing wave even when there is no mistuning or amplitude to phase 
conversion. But the calculation of dissymmetry needs to be more 
elaborate for the wider pulses; the simple calculation used above is 
inadequate. 

A more generally applicable theory is available from analysis 
by S. O. Rice who worked it out originally for pulses wide enough 
to spread over two time slots. How the simple theory is related to 
the more general one to be described in Section 2.61 will be discussed 
in Section 2.66. An outline of Rices’ analysis follows. It is assumed 
that neither mistuning nor amplitude to phase conversion is present 
so that the phase noise calculated is caused by the pulse train alone. 


2.6.1 Fourier Series for Pulse Train 


The general theory is based on a frequency analysis also and so 
the pulse spectrum is calculated first. We consider the train to consist 
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of repeated blocks of N pulse periods each, with each of the N pulse 
periods having a pulse or not at random. The train has a period NT 
and may be described by the Fourier series 


eo 


I(t) = >> C, exp (j2rmt/NT). (53) 
If the parameter N is made to approach infinity, a random pulse 
train will be obtained. But a good approximation results if N is just 
large, say 100. 

In order to consider pulses which may be as much as two pulse 
periods wide so that there is considerable overlapping of adjacent 
pulses, four auxiliary functions are necessary to specify the current 
I(t) in any one pulse period of duration 7. These functions correspond 
to the four possibilities of (2) no pulse present, (iz) only the leading 
edge of a pulse present, (v1) only the trailing edge of a pulse present, 
and (tv) overlapping pulses present. 

This is illustrated in Fig. 16 which is a short section in time of a 
random train of raised cosine pulses exactly two pulse periods wide. 
The four possibilities mentioned above occur in that order in periods 
3, 1, 2,5 of Fig. 16. For a random train of pulses all these possibilities 
eventually occur in any one time slot as suggested in the composite 
drawing of Fig. 17a. In this a number of sections in time of Fig. 16 
have been overlapped as they would be in an oscilloscope presenta- 
tion. In Fig. 17a, the four possibilities in the order given above are 
AC, AD, BC, BD. Another illustration is the same wave after passing 
through a half-wave rectifier which begins conduction at the half 
amplitude level as shown in Fig. 17b. Figures 18a and b are photo- 
graphs of oscilloscope patterns of real pulse trains which approximate 
the idealized ones of Figs. 17a and b. 

Then I(t) may be represented by a sequence of functions [,(t), 
each of which is specified by the auxiliary functions F(t’), F(t’), 
F,(t’) and the parameter a, as follows 


a, = 1, G1 =1, I,(t) = Fv’) 
ae a,. = 0, I,() = F(t’) 
0 a-1=1, I, = F3(¢’) 

0 


5 La) = .0; 


(54) 
a, = 0, an,-) = 


in the time slot interval 


(n-—1)T —»~T StSnT — 1T 
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Fig. 16—Short, time section of a random train of raised cosine pulses, each 
two time slots wide. 


and is zero outside this interval. In this representation, a, = 1 if a 
pulse begins in the interval under consideration and a, = 0 if a pulse 
does not begin in it. The time scale ¢’ used for describing the pulse 
train is related to the time scale ¢ of the Fourier series by 


t=t—(n—-—1)T +T. (55) 


The time shift (n — 1)T brings the pulse forms in the nth time slot 
back to the first one for description by the auxiliary F functions, The 





ONE TIME SLOT 


(b) 


{ 
F(t") F, (t’) 
t! 


A ’ 
(c) 0 T 0 T 


F(t!) / i" ; 


Fig. 17—Idealized pulse waveforms: (a) Overlapping raised cosine; (b) 
wave in (a) applied to half-wave rectifier; (c) functions used in calculation. 
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Fig. 18 — Photographs of random pulse oscilloscope traces: (a) Raised cosine 
pulses, two time slots wide at the base; (b) bottom half of (a) obtained with 
half-wave rectifier; (c) raised cosine pulses, one time slot wide at the base. 
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shift v7’ will be convenient later when it will be desirable to have 
the occurrence of pulses adjustable with respect to the zeros of the 
sinusoids in the Fourier series (53). To make J(t) a random pulse 
train, we let a,, dg,...,@,... be independent random variables with 
probability of a, = 1 being p and probability of a, = 0 being g = 
1 — p. 

The representation just described was designed with the situation 
of Fig. 17 in mind; but it is valid for other pulse shapes including 
those which occupy just one time slot or less. The transformation of 
time scales is illustrated for rectangular pulses in Fig. 19. 

The specifications (54) may be expressed by 


r™ 
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Fig. 19 — Representation of pulse train for Fourier analysis. 
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I(t) = a, — Gar) Poll’) + dar(l — a) F(t’) + and. F a(t’) 
_ a,l?,(t’) + Oni l3(t’) + AnOn—1[F'4(t’) =. Fi) — F,(t’)] (56) 


and I(t) is the sum of N separate I,,(¢). 


2.6.2 Fourier Coefficients 
The Fourier coefficients C’,, in (53) are then 


] NT-vT 
Cy = oa i exp (—j2amt/NT)I(1) dl 
—-vT 


x T 
= F x exp [—72rm(n —Ii- v)/N] / exp (—j2rmt’/NT) 


‘Lia —1—v)T + i’ di’ (57) 


where the integral from —y7 to NT — vT has been written in the 
second form as a sum of N integrals, each over an interval J and then 
the transformation t = (n — 1 — v)T + ¥ applied to each integral. Let 


T 
Bn = t= [ exp (—j2rmt'/NT)F.(t’) dt’ (58) 
0 
T 
im = / exp (—j2mml! /NT)F,() at! (59) 
0 
T 
Sn = rd , (exp (—j2rmt’/NT)[F.(t') — F(t’) — F3(t’)] dt (60) 
2 = exp (—72rm/N), Tm = exp (72avm/N). (61) 
Then 
1 N 
Cy = N a 2" (AnBm + An-1¥m + AnAn—1 5m)» (62) 
and the average value of C', is 
ae 2 
(Cudov = FF dX 2""(PBm + PYm +p” bn) (63) 
Since 
N 
2" =0 m # IN 
n=l 
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where J is an integer, we have 
(Caen = P(Bm Vin +p On) m= 0, tN; +2N, NE Bg 
(Cav = 0, otherwise. (64) 


The Fourier components for m = +N are those at the fundamental 
frequency of the pulse rate, 1/7. The average values of the “noise” 
components are all zero but the second order averages are not zero. 
Rices’ calculation of these follows. 

From the expressions for C,, and (C\n)ay In (62) and (63), it follows 
that 


(Cm > Cwpin Ci ey (C1)av) 


= N™ a os ee, a P)Bm is (Gn-1 = DYVm ae (BnQn—1 a D’) bm| 


n=1 k=l 


-[(, — p)Bi + (Qi-1 — D)¥i + (Q-1 — Dp) 5;] (65) 
where £ = exp(—j2l/N). Expanding the last product and using the 
independence of the a,’s (except for a, = ay) shows that the ensemble 
average of (65) depends upon the averages 

(a, rs D) x = p = Pg 
(a, — p)(ana; — p))w =p —p=pq, jn 
((QxQn—1 — DB)” ay = Pp — Dp 
(an; — ')(G.0; — D’))w = p’ — p= PG, 
JAN, Ln, tA i. (66) 
For n fixed, the only values of k which lead to nonzero averages 
are k = n and k = n + 1 with the understanding that for n = 1 the 
values k = 0, 1, 2 mean k = N, 1, 2, and for n = WN the values 
k=N—1,N,N+1meank = N —1,N,1. When k = n the average 
value of the summand in (65) is 
28" [pgBmBr + Dam: + (p° — P*) bm 51 
+ p°q(Bm 61 + Ym 82 + SmBr + Suy)]. (67) 
Fork =n +1 itis 
Ze" pg B m1 + DP Q(Bndr ++ Ont) fe P'q5m51]; (68) 
and fork =n—1, 


Se nay mbs + D’q(Ymt + SmB1) + p°qbndi)- (69) 
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Some experimentation shows that the sum of (67), (68), and (69) 
can be written as 


(26) [pg { Bay + my + Din(y + y')} 
{By +eyy + pity +y)} + p'@snd:], (70) 


where 
y = ¢ = exp (—jrl/N). (71) 
The sum of (zf)"", taken from n = 1 to N, is 0 unless 2g = 1, 
that is, unless MW += 0, +N, +2N,.... In this case 


N 


N? De) = NT, f= 2? = exp (4j2rm/N), 


n=1 


y = exp (—jrl/N) = exp [—jr(l + m)/N] exp (+jrm/N) 
(-1)°°"”" exp (jnm/N), (72) 


I] 


and (70) can be written as 


[pq(—-1)°*™ "S,8i + DG Sm51] (73) 


where 


Sn = Bum exp (jrm/N) + ym exp (—jrm/N) + 2p bn gO ae , (74) 


and S, is defined similarly with lJ in place of m, and 8, is the function 
defined in (60). 

Collecting results shows that averaging (65) over the ensemble 
gives 


(Cm — (Crm)ex)(Cr — (Car) 
es +7¢ bn &)/N, 
= m+l=0, +N, +2N,--: (75) 
lo. otherwise 


Replacing C; — (Ci) 5 $, 8:5 71, 6:, y by their complex conjugates 
in expressions (65) to (71), and noting that the sum of (z¢*)""" is zero 
unless 2¢* = 1, that is, unless m — 1 = 0, +N, +2N, --- , carries 
(75) into 
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(Cn — (Cmav)(Cr — (Cr)av)*) 
| [pq(—1)"""?'’"'S,,S# + p°q? bn St/N, 

= m—l=0, +N, +2N,-:-. (76) 
le otherwise 

In terms of the functions F(t) we have 


T 
5, = a exp (—j2amt/NT)[F, — F; — F.] dt 
0 


Sn = 7 [ exp (—demt/NI), [2pF. + (q — pfs + (gq — ps) 


- COS Me + j[F, — Fs] sin nin} dt. (77) 
The mean square value of the noise component of frequency m/NT is 
Ga = AC0) 5 (78) 


considering a positive frequency only spectrum. Thus from (76), we get 
o2, = ApglSnSt + pq bn 84] +: (79) 


2.6.3 Phase Modulation of Tuned Circuit Response 

To calculate the phase modulation on the recovered fundamental, 
we first obtain the response of the tank circuit to the train of rectified 
pulses, I(¢). This response is the approximate sine wave at the pulse 
rate frequency. 

The response I,(t) is described by 


I(t) = B,(t) cos [2rf.t + o(d)] (80) 


where the envelope R,(t) and the phase angle 9(¢) are slowly fluctuat- 
ing functions whose rate of change is proportional to the bandwidth 
of the tank circuit. Considering now, only those components of I(t) 
in the vicinity of the pulse rate 1/7’, we have 


I,(i) = 2Re >> Y,Cn exp (j2rmt/NT). (81) 
mN 


Writing Cm = (Cm — (Cm)av) + (Cm)ay and noting that (Cy)ay is the 
only nonzero value of (C»)ay in the summation, (81) may be rear- 
ranged* to be 


* Vy as used here is Ym for m = N and is not the (Y)y of Section 2.5. 
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I(t) = 2Re{ Yx(Cw)av exp (72rt/T) 
+ ya Y wlCn ~ (Cx)5) exp (j2rmt/T)} 


= 2ReYx(Cw)av exp (j2rt/T) 


VAC n — (Cm)av) Xp [j2r(m — N)t/NT] 
{1 ss x Yy(Cy)av \ 62) 


The summation part of this is small, partly because om/oy 1s small 
and partly because of the attenuation of the tank circuit Y,, except 
for frequencies near the pulse rate 1/7. Thus if the part within the 
brackets is represented by 1 + A + jB,|A + 3B | is small compared 
with unity and so 
1+A+ jB exp [A + jB (83) 
and 
T(t) & 2ReYy(Cw)aw exp [j(2rt/T) + A + 7B) 
~ 2 | Vy(Cy)av exp (A) | cos [2rt/T) + B + arg (Yx(Cy)av)]. (84) 


This is the approximate sine wave at the pulse rate frequency which 
has been recovered from the pulse train by means of the narrowband 
tank circuit. Its phase modulation is 


g(t) = arg (Yy(Cy)av) + B 
arg (Yy(Cy)av) + Im pS b,, exp [j2r(m — N)t/NT] (85) 


li 


where 


Bm = Yn(Cn — (Cm)av)/(Y w(Cn)av) + (86) 

The Fourier component of ¢(t¢) at frequency k/NT is determined 

by the two side frequencies mf,/N = (N + k)/NT and mf,/N = 

(N — k)/NT about the pulse rate as was found in the earlier analysis. 

To calculate this, we use the two terms for which m = N +k in the 
above sum. That is 


o, = Im [baz exp (j2akt/NT) + by_. exp (—j2akt/NT)] 
Im [(bv+e — b¥-x) exp (72akt/NT)]. (87) 


The time average of the “phase power” in this component is 


(Li)avt = 3 Dan i bi-. |? radians” (88) 
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which can be written 


(o2)ave = 3[D,D% — b,b. — D*DE + D*D_] (89) 


where the subscripts + and — denote N +k. 

The ensemble average of the expression (89) may be computed 
with the help of the second order moments of the C,,’s given by (75) 
and (76). For example 


(b..b*) ay = an YF (Cis a (Cyenyav) (Curse =, (Cy-snyav)*)/| ¥yCw asl 
= Y,Yi(pqS.8t + pg’ 54 5%)/| Yu(Cw)ev PN 
= [pqU.U? + p’q’V.V%)/N (90) 


where 


U, = (Y./Yy) (S./(Cy)av) Vie= (Y./Yy) (64/(Cy)aw) (91) 


and the subscripts + and — are used to indicate m, = N + k. Also 
m—Il= (N+ k) — (N+ k) = 0 so that the (—1) to the power 
(m — l)/N appearing in (26) is +1. Similarly, in the calculation of 
(b,b_), m +1 = (N +k) + (N — k) = 2N and the (—1) to the power 
(m + 1)/N appearing in (75) is again +1, and so on. 

After the ensemble average of (90) has been calculated, taking each of 
the four parts of (90) in turn, the terms may be combined to give one of 
the results we have been seeking: 


Cav = 5g {|U, — UX? + p¢|V.— V% |} radians’. (92) 


Since the Fourier components are f,/N apart, this expression for ¢; is 
equivalent to the ‘phase power” in a band f,/N wide. The averaging has 
been done over both time and the ensemble. 


2.6.4. Effect of Sampling and of High Frequencies in the Response 
Next we consider a generalization of the expression (92) as derived 
above for the phase modulation on the fundamental pulse rate ob- 
tained by passing the random pulse train through a tuned circuit. 
In deriving the result (92) for the phase modulation of the timing 
wave obtained from the tuned circuit response, only that part of the 
pulse train spectrum in the vicinity of the pulse rate was considered. 
While the major part of the tuned circuit response lies in the fre- 
quency region near the pulse rate 1/7’, as assumed in the calculation 
beginning with (81) and ending with (92), the way in which this 
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response is used in a regenerative repeater (or is measured in a phase 
detector) gives some importance to the part of the response neglected 
in (81). 

In a regenerative repeater, retiming of the message pulses is done 
by very sharp pulses which are generated at each positive (or nega- 
tive) going zero crossing of the timing wave. In the phase detector 
used in the experiments described here, the deviations of the zero 
crossings from their ideal periodic nature are measured and used as 
a sequence of numbers or held and smoothed by low pass filter to 
approximate the phase deviation function y(t). Thus in both these 
processes, it is the samples of the derived timing wave which are 
used. When the deviations are not too large, the magnitudes of the 
zero crossing deviations are equivalent to the magnitudes of samples 
of the phase deviation wave ¢(t), taken at the pulse rate. Since the 
high frequency part of the tuned circuit response which was neglected 
in (81) lies above one-half the sampling rate, the process described 
above as equivalent to sampling, may convert some of this high 
frequency part into very low frequency energy in the final timing 
noise result. In particular, if the high frequency part of the pulse 
spectrum has energy at or very near the harmonics of the pulse rate, 
this will be converted to zero or very low frequency energy in the 
phase noise spectrum. The analysis by S. O. Rice deals with this 
situation also. 

First the expression (81) for J,(¢) must be enlarged to include a 
de term and all values of m from 1 to o. This may be rewritten in 
the following form which corresponds to (88). We have 


I(t) = 2 Re YyCy exp 2ai/TI + $d, exp (—j2zt/T) 
+ >) d; exp [j2r(l — 1)t/T] + 3b, exp (—j2rt/T) 
l=2 


+ >° b,, exp [j2r(m — mynnys (93) 
m=) 
where | is an integer, 


d, os Yui(Cyt)av/ Yy(Cy)av 


and the coefficient b,, is defined as in (86). This expression is the 
complete response of the tuned circuit to the random train of pulses. 
Again assuming that the term within the square brackets in (93) 
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always remains near 1, 


g(t) & arg Yy(Cw)av 


+ Im iid, exp (—j2rt/T) + 3 d, exp [42r(l — nym} 


+ Im (30, exp (—j2nt/T) + >> d,, exp [j2r(m — mywry. (94) 


This is an improved version of (85). The first line on the right is 
approximately the ensemble average (o(t))ay. It consists of a dc 
term and harmonics of the pulse repetition frequency 1/7. While it 
may appear that the assumption about the bracketed part of (94) 
being small is unjustified because this represents the adding of har- 
monies to yield the pulse wave form, this is not the case since all the 
harmonics included are reduced at least by Q. Furthermore, since 
the sampling operation occurs near the zeros of the response, where 
the harmonics are zero or very small, an additional reduction of 
magnitude is involved. The noise portion of the power spectrum of 
y(t) arises from the second line, which may be written as 


g(t) — (e(t))av 


n © 


=m 40, exp (—j2rt/T) + >> bya, exp Genkt/TN) |. (95) 
k=1-N 
The sampling operation mentioned above, which is performed on 
the phase function y(t) in both the phase detector used in the experi- 
ments reported and in regenerative PCM repeaters, generates a new 
phase function @(t¢). This is 


A(t) & o(DT pa 6(t — nT) = o() 0) exp (j2rnt/T) (96) 


where 4(¢) denotes the unit impulse function. For some frequencies in 
g(t), the extraneous modulation products introduced by the impulses 
may be negligible. However, for the higher frequencies and for a single 
tuned circuit with slowly decreasing Y(iw), some of the products may 
become appreciable and should be taken into account. The sampling 
times in (96) were arbitrarily set at t = 0, 7, 27 --- for convenience in 
the analysis to follow. These can be varied to occur at or near the zeros 
of the response wave J,(t) by shifting the time scale of the description 
of the pulse train using the parameter » as indicated by (54) and related 
equations as illustrated in Fig. 19. Forming the ensemble average 
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(a(é))., , subtracting it from 6(é), and noticing that the sums in the 
expression (96) for @(£) are real, leads to 


o 


A(t) — (8(f))xv = Im 2. Do exp (j2ant/T) 


=-0 


zo y as bys, exp [y2r(k + anyi/w}. (97) 


n=—0 kel— 

The component of 6(t) of frequency 1/NT for 1 $1 < N — 1, that is, 
for frequencies which lie between 0 and f, , is the sum of terms having 
exponential factors exp (-4j2alt/NT). For k + nN = +1, the values 
of k and n are 

k= —-N+1], k 
n=l, n= 0, n= —-l, m= —2,°°-, 


and for k + nN = —l they are 


ON +1, ++, 


I 
Pa 
> 

I 


N+, &k 


k= —l, k=N-—1, k =2N —l,---, 
n= 0, n= -—l, N= —2,-°°, 


Therefore the component of 6(¢) of frequency 1/NT is 


Im [(b: + bysi + bevar + +++) exp (72alt/NT) 
+ (by-1 + Bey-1 + Oaw-1 + +++) exp (—J2al/NT] 
= Im [{(b: + bar + bewsr + +++) — (bw-1 + bew-1 + ++ +)*} 
-exp (j2rlt/NT). (98) 


When by+; and by_; are the dominant terms in (98), comparison 
with expression (87) (with k = 1) shows that the component of 6(¢) 
of frequency //NT is nearly equal to the corresponding component in 
y(t). By means of the procedure used earlier in (91) the average 
power in the component of 6(t) of frequency k/NT is obtained where 
we have returned to k from the J in (98). First, the time average of 
the power in the component is 


2 





1 wo foe) 
(Cy ier FS | a Dan+k = » biy—x 
2 n=0 n=] 


1 ive} ioe) foe) ie] 
a 9 b> bavek — se ut. |] Ore es pa it 
n=0 n=1 n=0 n=1 


where 0 < k < n. To average (99) over the ensemble we make use 


(99) 
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of (75) and (76) slightly rewritten by substituting exp(—jrm/N) for 
(—1)”/* and including it with S,,. Thus 


(Cm ~~ (Cma(C, = (Cr)ax)) 

= FE [Sm exp (—jrm/N)ILS: exp (—frl/N)] + PSF bq 4: , 
m+1l=0,+N,+2N,---, (100) 

(Ci, @ (Cmav) (C1 3 (Cie) as 

= a [S,, exp (—jrm/N)][S; exp (—jrl/N)]* + era 5, oF 


m—-1l=0,+N,+2N,::--. (101) 


The averages are equal to zero unless m and 1 satisfy the respective 
conditions. 
A typical term encountered in the averaging of (99) is 


(DansnO* nekyav i (bind) av 
wherem=nN +k,l=n’N +k, and m—J1= (n —7’)N. From (101) 
2,2 
(bub Doe = 5F [Um exp (—jrm/N)](Ur exp (—jnl/N)* + BF VaVE, 


m—-l=(n—n)N. 


In this U and V are generalized from the definitions of (91), so 
that for example 





_ Savek aN+k . 
Uaner ~< Cie Yy (102) 


Considering all four forms of (b,,b)sy as before, the ensemble aver- 
age of (99) is found to be 





(Gav = Say | 2 Unnee exp [— ina + 1)/N] 


Spe. ep lea ai 


n=] 


2.2 
po 
v9 





2 


(radians)’. (103) 








eo eo 
DS Vanek — se Viw-n 
n=0 n=1 


The exp (—jak/N) can be factored from the first absolute value and 
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then exp (+jrnN/N) replaced by (—1)”, so that 
(Onav = aa > (—)"U wee may > (—)"USy-2 


4 Be © p> Vivek = VS 
n=l 


which is the mee ae of (92) when the complete response of the 
tuned circuit is used. To repeat what was said before, since the Fourier 
components in the spectrum of @ are f,/N apart, this expression for 
(0;)av at f = k/NT is equivalent to the “phase power” in a band f,/N 
wide in a continuous spectrum. 

To obtain the continuous power spectrum w,(f) of the “noise’’ part 
of 6, we let m/NT = f' and N — o in the above expressions. First 
(77) becomes 


2 








(radians)’, (104) 








ap’) = SPLOtM) [exp (—j2nf' OUR, — Fy — Pa) at 


Sf) = SPNE@E LT! eel LH) [ . exp (—j2rf’t) {[2pF, + (q — p)Fs 


+ (q — p)F.] cos rf’'T + j[F, — F3\sinrf'T} dt. (105) 
Also , 
Uaner U(f’) 


U(nj. + f) 


_ Yi2enf +P) Sef +P 

Y(j2rf.) pLS(f.) + p d(f.)] 
where (Cy)ay has been replaced by p(Sy + péy) as derived from (64) 
and (74). 

Since the expressions derived earlier for the average power in a 
component of y(t) refer to positive frequency only, we shall deal 
with the one-sided power spectrum wg(f) of 6(¢). As before f = k/NT 
denotes the frequency associated with the average power expressed 
in (104). Then the value of the right side of (104) tends to w(f)Af = 
we(f)/NT as N > o and consequently 


(106) 





ao) ee BE 2 (YUM. +) — Dvr. - | 
a Epa. Y (radians)” per Hz, 
(107) 








d Vf. ~ 1 


where 0 < f < f.. 
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2.6.5 Application of Results in Section 2.6.4 to Particular Pulse Shapes 

Comparing the result (104) with the earlier (92), it is seen that 
each component of frequency f = k/NT in the spectrum of @ is made 
up of contributions from the pairs of components spaced f from all 
the harmonics of the pulse rate f,. The sampling of $(¢) has brought 
in all these additional contributions. 

Now we are in a position to find out what the effect is of neglecting 
the higher frequencies as was done in the previous work and espe- 
cially in deriving the result (92) to which (104) reduces when only 
n = 1 is considered. 

To do this, we will investigate several particular pulse shapes. As 
will be seen below, certain simplifications arise for n = 1 so that in 
some cases, approximate expressions for the noise may be derived. 
But, in general, and especially when the sums of (107) are to be cal- 
culated, the expressions become so complex that they cannot be dealt 
with readily in an analytical way. However, numerical calculations 
of we(f) in (107) answers most of our questions. Some of the com- 
putations for the rectangular pulses were first done by S. O. Rice. The 
others are extensions of them. The sums were carried to 30 terms 
for the rectangular pulses and to 15 or 20 in the other cases. Leveling 
off occurred before these cutoff points were reached. 

If each pulse is confined to one time slot, then J,(t) is determined 
entirely by a,. Thus, in the specifications (54), we have 


F;(t’) = 0; F(t’) = F,(t’). (108) 
As a consequence, it is seen from (59), (60), and (74) that 
bn = 0, Ym = 0 (109) 
Sn = Bn exp (jrm/N) (110) 
with corresponding simplifications in (75), (76), (79), and (92) 
and (104). 
2.6.5.1 Narrow Rectangular Pulse. First take the rectangular pulse of 
duration 7 used in Fig. 19. Here 
PY) =F) =1, CF—2)/2<l<T+7/2 


= 0, elsewhere. 


(111) 


Calculation of the pulse spectrum function S,, from (110), (58), and 
(61) yields 
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Sei, of Set Gaia ayy OP ee Deine ae E 


N (n + 2) sin r7/T 
(112) 


usingm=nN+kandx=k/N. 

The expression (3) in Section 2.1 for the ratio of noise power in a 
band B to carrier power is derived from (112) using (79) and (64), 
putting f’/f, for n + x, and noting that cy = 2pSy for a positive fre- 
quency only spectrum as in (79). Thus 


om _ _g SuSe 
ox DN Sy 
spf > 8m aha. 
ON (f')? sin? 7/7 
These components are f,/N apart and so the power of each corresponds 
to that in a band B of this width. The ratio «2/0, corresponds exactly 
to S’/A? in (3). 
To apply these results to the single tuned tank circuit centered on the 


pulse rate, the general form of Y,,,/Yy as given in (15) is necessary. 
This is 


(113) 


n+ 2 


Yoel Yn = 4 FoF ile + oF — 1 om 
Combining (15) with (112) in (102) or (106) gives 
U(nj. + f) 
_ (—)""* exp [jrx(1 — 2v)] exp [—jr2v(n — 1)] sin x(n + 2)7/T 
pin + « + jQ[(n + x)’ — 1] sin r7/T 
(114) 


with jf’ = nf, + fand m/N = f’/f.. 

First, notice that the factor exp (j2rzv) in (114) disappears in the 
calculation of (107) since exp (jrz2v) = exp * (jr(—2x)2v) and hence 
factors out of both sums. 

Next consider the unique conditions that arise when n = 1. The 
second exponential factor containing the parameter v, which determines 
the sampling time, disappears from (114). And the factor multiplying 
jQ in the denominator becomes proportional to x. The first means that 
the sampling time has no effect on the contribution of the n = 1 terms 
to wo(f). The second means that (U, — U*) > 0 as x — 0 and hence 
that the contribution of n = 1 to we(f) approaches zero as f — 0. 
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As indicated in Fig. 19, the zero crossing of the fundamental com- 
ponent comes at the ¢ = 0 sampling time for vy = 4; hence this value was 
used in the computations. Timing noise spectra obtained in this way 
for several durations of rectangular pulse are plotted in Fig. 20. In 
order that the effect of the high frequencies may be easily seen, the 
corresponding spectra considering only the n = 1 term are shown in 
Fig. 21. 

Several interesting points are evident: 


(2) Comparing the two sets of spectra, it is seen that the high fre- 
quencies have brought in noise at and near zero frequency, except for 
the 7/7’ = 0.6 duration.* 

(it) There is a great difference in the full spectra for 7/T > 0.5. 
This was observed experimentally for other values of 7/7 than 0.6. 

(277) The high frequencies in the timing tank response, through the 
sampling process, have not only brought in very low frequency phase 
noise, but reduced the higher frequency noise except for 7/7 = 0.6. It 
is the phase structure of this noise which makes cancellation as well as 
addition possible. 

(tv) In the case of 7/T = 0.6, there is only a small difference between 
the spectrum obtained with the full spectrum and that from considering 
only n = 1. 

(v) There is very little difference between the spectra for r/T = 0.1 
and 7/7’ = 0.02. This suggests that the phase noise may not disappear 
for pulses which approach spikes in shape. 

(vt) The magnitude of the noise, when compared with that of the 
usually more practical rounded shapes, is quite small, as seen in Fig. 4. 
It will be seen below, that the effect of the high frequencies is much less 
for these other pulse shapes. 


For the narrow pulse case, 7/7’ = 0.1, the timing noise spectrum 
can be changed greatly by small amounts of tank circuit mistuning, as 
shown in Fig. 22. This also can be a cancellation or an addition effect. 

When the tank circuit is mistuned from the pulse rate, -: x is replaced 
by x, + xin (15) for the normalized admittance. Here z, is the relative 
mistuning as defined earlier in (18). 


2.6.5.2 1T Raised Cosine Pulse. A photograph of an oscilloscope dis- 
play of this pulse shape is given in Fig. 18c. Each pulse is confined to 


* The possibility of this property of the noise was pointed out by H. E. Rowe. 
See Ref. 6. 
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Fig. 20 — Calculated spectra of timing noise for several durations of rectangular 


pulses. 
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Fig. 21 — Calculated spectra of timing noise for rectangular pulses (n = 1 only). 
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Fig. 22 — Spectrum of phase noise for rectangular pulse. 


a single time slot and so the relations (108), (109), (110) hold. Here 


FL.) = F,(@’) = 1 — cos (22t/T) (115) 
and 
Serf. +f) _ _2 exp (Bava) oxp [avin — Nsine@+2) Gig 
Sf.) 7 (n + 2)[@ + 2)’ — 1] 


The proper value of v is 1/4. 

The spectrum of timing noise is shown in Fig. 35, where it is seen 
that there is no noise energy at zero frequency. The pulse spectrum 
not only has nulls at all the pulse rate harmonics except the funda- 
mental, but has very small energy in all the high frequencies. Thus, 
in this case, the high frequencies have very little effect on the timing 
noise. This was verified by both further calculation and measurement. 

For z not too large, the noise spectrum is nearly 


wolf) we (F298 Oa rad/Hz. (117) 
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2.6.5.3 1.5T Raised Cosine Pulse. This pulse shape is drawn in Fig. 
23a. A single pulse is described by 
f® = 1-—cos(4at/3T). (118) 
Consideration of the specification (54) shows that 


Ft) — Ft) = (0) — 1 + im g<verp 


F(t’) = fl’) + fv + T) (119) 
Fi’) — F(t’) = 7(¢) T/2<0 <a, 
F(t’) = fv’) 


Thus, even though there is overlapping of adjacent pulses, 8, = 0 
and the pulse spectrum is completely specified by S,,. Consideration 
of the waveform of Fig. 23(a) for all pulses present shows that v = 1/2 
is the proper value here. S,,/Sy is independent of v for this value and is 


Dac. : » 9 [1 — exp (—jrm/N)| 
Sy OX Gee)" 8 G/N (4/9 — m2 /N*) 


4% [1 + exp (—jrm/N)] + : exp Ginm/n)}- (120) 





(b) 


0 a 2T 3T TIME,t —> 


Fig. 23 — Pulse waveforms. 
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The spectra of timing noise, caused by this pulse shape, calculated 
and measured, are plotted in Fig. 36. While the magnitude (rms) of 
this noise is about 4 times that for 1T pulses, the energy at zero fre- 
quency is 0.00075 degree in a 10 Hz band, which is quite small. Hence 
overlapping of pulses does not appear to be a source of very low fre- 
quency timing noise. As in the previous case, the high frequency part 
of the response is small and so has only a small effect on the timing 
noise. 


2.6.5.4 Asymmetrical Overlapping Pulses. The particular pulse shape 
chosen here is pictured in Fig. 23b, where it is seen that the rise occurs 
in one-half pulse period while the decline takes a whole period. The 
auxiliary function F.(2) is described by 


Fit’) = 1 — cos 2rt’/T 0<’ <T/2 
= 1 — cosr(t’ + T/2)/T T/2< < 37/2. (121) 


From this and F3 and F4, it is found that the 8 function is zero as 
in the previous case. The spectrum function S,, is 


Sn _ 23 exp (j2rav) exp [j2m(n — 1)] 
Sy (1 — j4/3m)(n + 2) 


1! + exp [—j2r(m + a)] _ 1+ exp [jn + a}. (122) 
1 — 4(n + 2)’ 1—(+2) 


Estimates indicate that v = 0.35 will bring the zero crossing of the 
fundamental term very close to the sampling time of t = 0 and so 
this value was used in the calculations. The spectrum of timing noise 
caused by this pulse shape is plotted in Fig. 36, where it is seen that 
the amount at zero frequency is quite small, although somewhat 
larger than that of the symmetrical overlapping pulses. Hence, asym- 
metry of pulse shape does not appear to be a significant factor in 
very low frequency timing noise. 

Since no readily available means for generating this pulse shape 
in the laboratory was found, there is no measured data. The good 
agreement between calculation and measurement in the other cases 
gives considerable weight to the calculated curve. 


2.6.5.5 Rectified 2T Raised Cosine Pulses. Rectified 27 raised 
cosine pulses are pictured in Figs. 17 and 18. Before rectification, a 
single pulse occupying two time slots is represented by 
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fo(t) = 1 + cos (w,/2)t -T<t<T (123) 


where f, = w,/2m is the pulse rate. After rectification, the auxiliary 
functions are shown in Fig. 17c and described by 


F(t’) = 0, 0s? ST/2 
= —cos (rl’/T), T/2S3 0S 7 

F(t’) = cos (rt'/T), 0s? sT/2 (124) 
= 0, T2= 7 ST 

Fv’) = 1, CH ST. 


Because of the nonlinearity, the § function is not zero. The pulse 
spectrum is described by 


1 exp [—j87/2)(m/N)] cos (rm/N) sin (rm/N) 
(m/N)(1 — 4m*/N*) 


1 exp [—j@7/2)(m/N)][sin (rm/N) — 2m/N] 
T (m/N)(1 — 4m?/N*) 


Sy =0, dy = j2/3r, (Cy) = j/6m for » = 1/4, p = 1/2. 


The spectrum of timing noise calculated from (125) is plotted in 
Fig. 37. The sums in (107) were carried to 15 terms, but the higher 
order terms added very little. The results are very close to those for 
n = 1 except at zero frequency. The higher frequency terms with alias- 
ing do generate some noise at these. The amount, which is difficult to 
see in the Fig. 37, is 0.0023 degree rms in a 10 Hz band, about three 
times that for 1.57’ pulses and one-half that for the asymmetrical 
pulses. 


Sa = 


3 


(125) 


bm = 


2.6.6 Relation of the General Theory to the Simpler One 


The operation U; — U* which appears in (92) for the calculation of 
phase deviation is essentially the same as the operation used in Section 
2.2 for determining symmetrical and antisymmetrical components or 
in phase and quadrature components. The expression U, — U* is 


U, — UE = (8i/(Cy)av)(V/Vn) — (S-/(Cw)av)*(Y-/¥n)*. (126) 


In the simpler derivations of Section 2.2, it was assumed that the 
pulse train was such that (Si /(Cyw)ay) = (S—/(Cw)ay)* and so could 
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be factored out leaving 


U, — UE = (S/{Cw)w)[¥s/Yu — (Y-/Yy)*]. (127) 


Or, in other words, the pulse train contributed to the phase deviation 
in magnitude only. The part within the brackets is the conversion 
factor caused by mistuning, if any, in the tuned circuit described by 
Y. In the case of the offset trigger, a separate conversion factor was 
derived. 

The simple theory cannot be applied generally for the wider pulses 
and particularly in those cases when the § function enters into the 
pulse spectrum description. In some cases considered in detail, it was 
found that the strength of noise components o, (and hence the dis- 
symmetry between side frequency pairs about the pulse rate) depends 
largely on the $,, part of (79), and very little on the S,, part, while the 
situation is just the reverse for the phase deviation ¢, in (92). 


III. SYSTEM USED FOR MEASUREMENTS 


3.1 Principal Apparatus 

In Fig. 24, the connection diagram of Fig. 1 has been revised to 
show the detection and remodulation process. This also shows why 
the actual apparatus used (bottom diagram) is really parts of two 
repeaters. 

The complete block diagram of the apparatus used, corresponding 
to the simplified diagram at the bottom of Fig. 24 is shown in Fig. 25. 
The functioning of this apparatus will now be described in more 
detail. The principal sections are: 


3.1.1 Pulse Regenerator 

The pulse regenerator has been especially developed so that it will 
not add any timing noise of its own. It was worked out mainly by 
C. R. Crue following plans made by S. L. Freeny. To make the first 
record (simulating repeater 1), the regenerator input is switched to 
the source of clipped random noise, thus generating a random train 
of pulses. Thereafter it is switched to the recorder playback so that 
the same sequence of pulses, though random, is used for each trans- 
mission through the apparatus. A fixed bias may also be connected 
to the regenerator input so that it sends an all pulses present train 
to the system for calibration and testing. 


TIMING NOISE 591 


REGEN- 
ERATOR 







REGEN~ 
ERATOR 











PHASE 
MODU—-— 
LATOR 












TR3 


REGEN- 
ERATOR 










PLAY- 


BACK STANDARD RECORD 


REGEN- 
ERATOR 





TR2 TR4 

















ANALYSIS CALIBRATE 


Fig, 24— Block diagram of simulation of chain of regenerative repeaters. Part 
within dashed section at top is redrawn at bottom. 


3.1.2 LC Tank Circwit 

The LC tank circuit which derives the timing wave from the in- 
coming pulse train uses an air core coil and positive feedback to 
achieve a Q of 100. It is necessary to use an air core coil and to keep 
it in a temperature controlled oven in order to measure phase with 
the required precision of less than 0.1°. 


3.1.3 Amplitude Limiter 

A very important section and one which is difficult to achieve is 
the amplitude limiter which removes very nearly all the amplitude 
variation from the timing wave so that in the detection process, only 
the phase deviation of the timing wave is measured. For most of the 
measurements, the limiter has two stages, each consisting of amplifier, 
cathode follower, and series limiter made up of resistance and a pair 
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of diodes. The amplifiers were made to approach linearity very 
closely and are isolated from the limiters by cathode followers in 
order to make amplitude to phase conversion negligible. 


3.1.4 Phase Detector Characteristic 

The phase detector characteristic, volts versus phase, extends 
through zero equally in both directions, and is linear over a wide 
range. How it works is explained briefly in the simplified diagram 
of Fig. 26. There are two inputs to the phase detector, the signal 
wave and the reference standard. At the positive zero crossings of 
these waves, sharp pulses are generated to operate the flip-flop whose 
output is the square wave at # when A and B are opposite in phase 
as shown. The edge of the square wave at EH, controlled by the refer- 
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Fig. 26 — Block diagram and operation of phase detector. 
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ence standard, is fixed; the other edge varies as the phase of the 
signal wave. The average value of EZ appearing at the low-pass filter 
output is then proportional to the time variations of the positive zero 
crossings of the signal wave, that is, its phase. The low-pass filter 
and a little shaping in the de amplifier are such that the bandwidth 
available for the detector output as indicated in Fig. 26 is about 0.4 
the pulse rate. The noise wave thus derived is nearly the best con- 
tinuous representation of the zero crossing deviations. 

Since part of this detector is like a sampler operating at 1 kHz, 
there will be aliasing in the process if the original time variations 
contain frequency components higher than 0.5 kHz, as discussed in 
Section 2.6.4 and 2.6.5. 

With the time interval unit and counter connected at EH, the time 
duration from a fixed edge of the square wave to the adjacent vari- 
able edge can be measured so that individual zero crossing variations 
as well as the smoothed wave v may be obtained from the detector; 
or the time interval unit may be operated to measure from one vari- 
able edge to the next. In this way “spacing jitter” may be measured. 

This connection is also used in the standardization of the detector 
characteristic. With an all pulses present train applied to the tuned 
circuit, a steady sine wave is obtained at the signal point A of the 
phase detector. The precision phase shifter which supplies the refer- 
ence at point B is adjusted until the duration of the positive part of 
EF is 500.0 microseconds. Then a bias adjustment is made to bring 
v to zero volts. After this the phase shifter is moved by various 
amounts from this reference condition and the amplifier gain ad- 
justed to give the proper detector output voltage. In this calibration 
a Leeds and Northrup potentiometer is used for voltage measurement. 


3.1.5 Phase Modulator 

The phase modulator generates a sharp pulse at the instant voltage 
coincidence occurs between the input signal wave and a very “linear 
sawtooth” wave generated from the 1 kHz standard. This pulse is 
then used in the regenerator for timing the new pulses. The modulator 
must generate this timing pulse just as precisely as the phase detector 
detects the zero crossings of the original timing wave. It is calibrated 
by applying a known voltage which then causes a phase shift through- 
out the system. This phase shift is then converted to a voltage by 
the phase detector and the result compared with the input. The modu- 
lator sensitivity is adjusted so that the detector output is the same 
as the modulator input. Thus all calibration of the detection and 
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modulation process is in terms of two absolute standards, the Leeds 
and Northrup potentiometer for voltages and the phase shifter for 
phase angles. 

The circuit for the modulator was originally developed and worked 
out by L. R. Wrathall. This was revised somewhat and built in final 
form and thoroughly tested by C. R. Crue. 


3.1.6 J'ape Recorder 

The tape recorder used is an Ampex FR-100B with servo speed 
control such that reproduced signal time never varies by more than 
+0.25 ms (1 ms = 1 pulse period) from precise time. Because the 
noise wave is essentially a low frequency signal with most of its 
energy below 5 Hz, and because frequency modulation (FM) record- 
ing is used, these small variations in time will affect only the time 
at which this old noise is added to the new. Tests have demonstrated 
that whatever time variation there is has a negligible effect. FM 
recording at 30 inches per second is used on five tracks, one of which 
is for speed servo. The machine is operated at 50 Hz derived by 
step-down chain from the 1 kHz standard which in turn is derived 
from the 1 MHz crystal oscillator in the counter. Two tracks pro- 
vide a 5 kHz band for the pulse train. The other two tracks are for 
the phase noise signal and the bandwidth for these has been reduced 
to 1.25 kHz to lessen recorder noise. The gain through these channels 
is made unity. A phase shifter in the 50 Hz supply to the recorder 
makes it possible to align the recorded pulses with the gating triggers 
at the regenerator input each time the recorder is started. 


3.1.7 Delay Network 

The delay network in the pulse path makes up for the delays in 
the noise path caused mainly by the detection and recording process 
so that the noise record and pulse record correspond at any time. 


3.1.8 Trigger Circuit and Envelope Detector 

A trigger circuit with adjustable triggering level was introduced 
into the system in place of the limiter for the investigation of ampli- 
tude to phase conversion effects. Operation and data taking were 
simplified by removing the limiter entirely even though in a real 
repeater some limiting, though imperfect, would be used. 

For the measurement of the properties of amplitude variations of 
the tuned circuit response, an envelope detector was connected at the 
tuned circuit output. 
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3.1.9 Pulse Shaping Networks 

For the study of wide pulses and intersymbol interference, pulse 
shaping networks were introduced between the pulse regenerator and 
the tank circuit. These were RC circuits and low-pass filters for 
producing a sine-squared shape. In some cases a half-wave rectifier 
was introduced between filter and tank circuit. 


3.1.10 Longword Pulse Pattern 

In order to investigate some aspects of timing noise, an attachment 
to the signal generator was made so that a longword pulse pattern 
with a period of 240 bits, and hence having important components 
within the band of the tank circuit, could be applied to the system. 
The basic parts of this attachment are (2) a code plate with a rectan- 
gular 15 by 16 array of holes, placed before (w) a cathode ray tube, 
the electron beam of which is swept across all the holes successively, 
and (wt) a light sensitive device to convert the light coming through 
the code plate holes into pulses. The longword signal pattern which 
is desired is then brought about by blocking out with black tape the 
proper holes in the code plate. This apparatus was developed by C. R. 
Crue. 


3.2 Method of Operating the System 

In using the system to obtain a series of noise records corresponding 
to the timing noise at successive repeaters in a chain, the procedure 
is as follows. After calibrations have been made with the all pulses 
present condition, the regenerator is connected to the clipped noise 
source (which has been adjusted to give the desired pulse density) 
and recordings are made of the pulse train on track 5, the generated 
phase noise in the timing wave on track 3, and the 50 Hz servo control 
wave on track 1, for about 15 minutes. Then the tape is rereeled, 
calibration checked, and playback started with track 3 going to the 
phase modulator and track 5 going to the pulse regenerator. This 
time the pulse train is recorded on track 4 and the timing noise on 
track 2. We now have two timing noise records corresponding to the 
timing waves at the first and second repeaters of a chain and these 
are now analyzed following the procedure outlined below. 

Notice that it is the phase noise on the timing wave which is 
analyzed here rather than the repeater output pulses. These are the 
same, of course, in a completely retimed repeater. Therefore, it is 
not necessary that the recorded pulse train have the accumulated 
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timing noise; it is a perfectly timed replica of the original pulse 
train. The first part of the regenerator removes any time variations 
acquired in the recording-playback process. In the first recording, 
a few minutes of all pulses present is recorded before switching to 
the random train to give the recorder servo time to synchronize and 
also to allow time for the alignment of recorder to system each time 
it is started. 

To obtain the next pair of records corresponding to transmission 
through repeaters 3 and 4 of the chain, track 2 is played back to the 
phase modulator, while the detected accumulated noise is recorded 
on track 3 thus erasing the record first made there. Then track 3 is 
played back to the modulator with the new noise record being made 
on track 2. And so on, as long as desired or until some difficulty in 
the process arises. 

During the playback, the alignment of system and recorder is 
monitored continuously to make sure that the recorder stays in syn- 
chronism. If it does not, the pulse train may be altered. 

A great amount of time and effort has gone into the building of 
the system just described to make it sufficiently stable and accurate 
for measuring phase deviations to within less than 0.1° out of 10°. 
It is necessary to measure with this precision in order to be able to 
describe accurately the change in noise from one repeater to the next 
because of the small amounts involved. The rms value of noise gen- 
erated at one repeater with Q = 100 and 0.1 percent detuning is a 
little less than 1°. 

Another factor in the reliability of the data is that a long enough 
signal was used for analyzing so that fluctuations in the plotted 
parameters of the noise were fairly small. As described below, 32 
ten-second averages of the time the noise wave spends below each 
threshold are used for each point on the cumulative distributions. 
Since the counter rests for 10 seconds after each adding period, the 
length of signal involved is 640,000 pulse positions. Each plotted 
point is well enough established, so that the curve connecting them 
is smooth without the necessity of further averaging. 

The residual noise at the detector output for all pulses present 
is about 0.006° rms. For a random train of pulses (narrow) there 
should be no phase noise generated if the tank circuit is centered 
exactly on the pulse rate. In this situation, the residual noise is about 
0.012° rms. See Fig. 28. This is not only a good test of the system 
as a whole, but is a good dynamic test of the limiter, which is hard 
to do in any other way. 
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3.3 Apparatus for and Process of Analyzing Data 


3.3.1 Cumulative Distribution-Slicer Circutt 


To obtain the cumulative distribution of the noise, a slicer circuit 
was developed. This is an adjustable threshold device which generates 
a standard height pulse whenever the noise wave is below the thres- 
hold. The duration of each of these pulses is measured by counting 
the number of cycles of a 100 kHz wave which the pulse gates to 
the counter. The accumulated durations of all these pulses which 
occur in a standard interval of 10 seconds is then totaled by the 
counter. 

At each threshold setting, 32 of these totals are obtained and 
plotted in control chart fashion as shown in Fig. 27. This helps us 
to see if the data are statistically acceptable. If appreciable trouble 
has occurred in the apparatus during the run, it will show up in this 
picture. The median taken from each chart of data is used to plot 
one point on the distribution. After the distribution is plotted, the 
rms value is taken from 1/3 the difference of the values at the 93.3 
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Fig. 27— Control chart plot of data to obtain one point of cumulative dis- 
tribution (as in Fig. 31) of timing noise amplitudes. 
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percent and 6.7 percent points, following a method suggested by E. B. 
Ferrell for skewed distributions. 


3.3.2 Spectrum Density—W ave Analyzer 

To obtain the spectrum density of the noise wave, a General Radio 
1900A wave analyzer with external rms indicator is used. In order 
to extend the nominal 20 Hz lower end of the analyzer to around 1 
Hz, a 16-to-1 speedup of the noise wave is made in a second recording 
process. Each point on the cumulative distribution requires about 11 
minutes of signal, so the original noise record is played back and 
then rereeled for each point. During each playback the noise record 
is duplicated at lower speed by bridging the input of another FR-100 
recorder operating at 1% inches per second at the main recorder out- 
put. The new record then consists of about 8 to 10 serial duplications 
of the noise which, when played back at 30 inches per second, pro- 
vides about 5 minutes total of the original noise with all frequency 
components multiplied by 16. A wave analyzer band of 10 Hz then 
is equivalent to a 0.625 Hz band for the original noise. 

In order to obtain consistent and reliable measurements of noise 
power in the wave analyzer band, it was found necessary to replace 
the linear detector provided in the wave analyzer with a square law 
detector. To do this, the wave analyzer IF output was connected to a 
Ballantine rms meter and the square law response of this applied 
to a 4-second time constant RC smoothing circuit and a linear scale 
meter. The pointer of this meter still fluctuates, thus requiring some 
kind of average. This average was obtained arithmetically from 20 
successive meter readings made at 5-second intervals and, after cali- 
bration, was taken as the measure of the mean square power of the 
noise falling within the wave analyzer band at each frequency setting. 

The variance of each of these readings depends partly on the 
analyzing filter bandwidths and partly on the lengths of signal record 
available. The latter part could be obtained only by calculation and 
appears to be the dominating part. Variance estimated this way 
indicates that any reading taken by the above method is within 10 
percent of the true value with 95 percent confidence. The data taken 
appear to have less variation than this. Some further smoothing of 
an unknown amount occurs in the plotting of the individual points 
to give a smooth spectrum curve. Integration of these curves con- 
sistently gave results agreeing (within a very few percent) with 
measurements of total noise power. 
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IV. RESULTS OF EXPERIMENTS 


4.1 Use of Narrow Rectangular Pulses 


The first measurements of phase noise on the timing wave recovered 
from a random train of narrow pulses by a tank circuit mistuned from 
the pulse rate showed that the magnitude of the noise is proportional 
to the amount of mistuning when this is only a few tenths of a per- 
cent, as predicted by W. R. Bennett.1 Hence there should be no noise 
for zero mistuning. It was found difficult to define, though, just what 
zero mistuning is for a real LC tank circuit. In the experiments 
reported here, the tank circuit consisted of an air core coil and capaci- 
tor in shunt at the collector of a transistor, the emitter of which had 
a resistor equal to the resonant impedance of the LC tank. The 
transistor was driven at its base and there was positive feedback 
using another transistor to bring the effective Q of the tank to a 
value of 100. The reference point from which mistuning was measured, 
chosen because it could be easily set with sufficient precision, was 
that of 180° phase between the collector and emitter voltages of the 
tank transistor as determined by an oscilloscope Lissajou figure. 

That the minimum of phase noise does not occur at this point is 
shown by the curve of Fig. 28. Neither is its value zero, being made 
up partly of the residual noise of the system as indicated by APP 
(for all pulses present) and partly of the noise from the train of 
random narrow rectangular pulses. However, in the investigation of 
noise caused by pulse shape alone (Section 4.8), it was found that 
this minimum does not coincide with zero mistuning. Rather, the 
minimum is the result of cancellation by small amounts of mistuning 
and trigger offset of part of the noise attributable to pulse shape. 
While the minimum is about 0.012 degree, the noise from the pulses 
is about 0.086 degree. This is the true zero mistuning and occurs 
about where the curve crosses this ordinate. Even though the noise 
contributed by the narrow rectangular pulses is not zero, it is quite 
small as may be seen from the spectrum curves of Figs. 4 and 20. 
Hence using the narrow rectangular pulses in the measuring of noise 
caused by tank circuit mistuning and by offset trigger gives results 
which very nearly isolates these as sources for individual scrutiny. 


4.2 Tank Circuit Mistuning 

The curves of Fig. 2 show how the spectrum changes as the noise 
is examined at successive repeaters, each mistuned by the same amount 
and direction, in a chain of six repeaters. Distinctive features of these 
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Fig. 28— Measured timing noise as a function of tank circuit detuning near 
minimum. This shows residual noise of the system 


spectra are the zeros at zero frequency and the maxima which occurs 
at f = f,/2Q for N = 1 and closer to zero as N increases. Additional 
measurements on longer chains up to 20 in length show that this trend 
continues and agrees with magnitudes predicted by the theory pre- 
sented in Section II. In Fig. 14, these predictions are plotted and 
extended to a chain of 100 repeaters. It is evident from these results 
that while the maximum continues to rise, its magnitude will never 
be greater than four times what it was at the first repeater. The 
reason for this is that the noise spectrum generated at each repeater 
is zero at zero frequency and the succeeding tank circuits continually 
attenuate the higher frequency components. These effects are seen 
also in the data on the total phase noise along the chain. For chains 
up to 20 in length, measured and calculated values are plotted in 
Fig. 29. Calculated values for longer chains are plotted in Fig. 15. 

Figure 30 shows the measured and calculated spectra of phase 
noise at two adjacent repeaters when the second one is mistuned in 
the opposite direction but by the same amount as the first one. This 
has the effect of reversing the dissymmetry of side frequencies about 
the carrier in the second repeater, and so there is a partial canceling 
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_ Fig. 29— Total timing noise caused by 0.1 percent mistuning of timing tanks 
in chains of N like regenerative repeaters. 


of phase noise. Thus the greatest accumulation of timing noise caused 
by mistuning comes when all repeaters in a chain are mistuned the 
same way. 

In all these cases, we have seen how well the values of timing noise 
calculated from the theoretical model of Section II agree with those 
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Fig. 30—Spectra of timing noise at two successive regenerative repeaters 
with oppositely detuned timing tanks. 
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obtained by measurement. The theoretical model was developed from 
the simple picture of the equivalence of phase modulation of the 
recovered carrier and dissymmetry of the side frequencies about this 
carrier introduced by the mistuning of the tank circuit into the other- 
wise symmetrical side frequency and carrier representation of the 
random pulse train. 

The cumulative distribution curve of phase noise generated at a 
mistuned repeater, obtained from measurements, is shown in Fig. 31 
and is seen to have a shape which is approximately log-normal. Cal- 
culation of a distribution curve which agrees quite well with these 
results has been made by M. R. Aaron and J. R. Gray.® When the 
distributions of timing noise at successive repeaters along a chain 
are examined, it is found that the skewness is gradually reduced. 

Another set of data from the measurements of timing noise caused 
by mistuning is that concerning spacing noise which is displayed in 
Fig. 32. Spacing noise is defined as the deviations from normal of the 
spacing between successive positive (or negative) going zero cross- 
ings of the timing wave. As suggested by M. R. Aaron and H. E. 
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Fig. 31— Cumulative distribution of timing noise amplitudes obtained from 
measurements (noise caused by tank detuning). 
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Fig. 82— Timing wave spacing noise caused by tank mistuning. Measured 
deviations of zero-crossing differences from 1.0 millisecond. 


Rowe, the reason for this lies in the quantized character of changes 
in the pulse pattern. That is, at any point in time, the next time slot 
has either a pulse or no pulse. The magnitudes can be found from 
the results shown in Fig. 39a for one absent pulse in 240 and mis- 
tuning. Scaling this to 0.1 percent mistuning (condition for Fig. 32), 
gives a peak phase change of 0.36° or 1 ys. For a single pulse added, 
the phase change would have the opposite sign. Since the most likely 
change in the random pulse train is that of a single pulse added or 
left out and other changes are less likely, the distribution of Fig. 32 
should have peaks near +1 ys and —1 ps as observed. 


4.3 Amplitude-to-Phase Conversion 

The pulse rate fundamental recovered from the signal pulse train 
by narrowband tank circuit has appreciable phase deviations only 
if the tank circuit is mistuned from the pulse rate. But this funda- 
mental has noisy amplitude variations even when the tank circuit is 
perfectly tuned. 

By connecting an envelope detector at the tank circuit output, 
recordings were made of the response amplitude variations. The spec- 
trum is shown in Fig. 33 where it is seen to have a nonzero value as 
frequency approaches zero as predicted by the calculated values 
superimposed and as also shown by the calculated curve of Fig. 8. 
That this must be so may be seen by considering that the noise side 
frequencies continue to exist at about the same magnitude as they 
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get closer to the carrier (pulse rate). That is, the amplitude modula- 
tion is not reduced, only the dissymmetry disappears. The measured 
cumulative distribution of the amplitude variations is very nearly 
normal. 

One way in which these amplitude variations may be converted 
into phase variations is through an imperfection in the trigger circuit 
which, from the timing wave, generates sharp pulses for retiming the 
signal pulses. Such will be the case if the triggering level of this cir- 
cuit is, for some reason, offset from the zeros of the timing wave. 

This kind of timing noise was generated in the system for simulat- 
ing a chain of regenerative repeaters by replacing the amplitude 
limiter which follows the tank circuit in Fig. 9 with a trigger circuit 
having an adjustable threshold. In a real repeater, some amplitude 
limiting, though imperfect, would be used between tank and trigger, 
but here it is more convenient to leave out all limiting. 

Spectra of this amplitude to phase timing noise, when there is no 
mistuning, at successive-like repeaters in a chain of six are plotted 
in Fig. 3 along with calculated points. The spectra have the same 
shape as that of the amplitude variations since the two phenomena 
are directly related. The rms magnitudes at very low frequencies 
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Fig. 33 — Spectrum of amplitude variation of tuned circuit response to random 
unipolar rectangular pulses. zs 
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grow directly with repeater number because of the nonzero magnitude 
of noise and nearly zero tank circuit attenuation at very low fre- 
quencies. Total timing noise in the same situation is shown in Fig. 
34. The cumulative distribution of the noise is somewhat curved, 
about like the amplitude to phase conversion characteristic of the 
trigger circuit. 

Timing noise was measured also when mistuning and amplitude to 
phase conversion were both present in the same repeater. This is 
shown in Fig. 10, along with calculated values. 

We see in all these results very good agreement between the mea- 
sured values and those calculated, as outlined in Section II, by the 
method developed first in the investigation of noise caused by mis- 
tuning. 


4.4 Phase Noise Attributable to Pulse Form 


4.4.1 Rectangular Pulses 

After it was found that the low sharp minimum of timing noise 
shown in the curve of Fig. 28 was attained by small deviations from 
zero mistuning and zero trigger offset, fairly good agreement between 
measured and calculated noise spectra for rectangular pulses was 
obtained. Most of the curves of Fig. 4 are from measurements while 
those of Figs. 20 and 21 are from calculations. 
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Fig. 34— Total timing noise from amplitude variation of timing wave and 
offset trigger as a function of the number (N) of repeaters in a chain, 
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It is only the narrow rectangular pulses which have the magnitude 
and shape of noise spectrum which can be changed to produce the 
result of Fig. 28 by small adjustments of the two indicated factors. 
The calculated curves of Fig. 22 show how this can be. For rectangular 
pulses with 7/T’ = 0.6, the total noise changes only about 10 percent 
as the tuning varies over the range of Fig. 22. 

An attempt was made to measure the effect of high frequencies in 
the tank circuit response by cutting out these components; it was not 
entirely satisfactory because of the difficulty of obtaining a suitable 
filter. The results were sufficient though to verify the general features 
of the differences between the curves of Fig. 20 and Fig. 21. 


4.4.2 Raised Cosine Pulses 

Trains of pulses approximating the raised-cosine shape were gen- 
erated by applying the 10 percent duty factor rectangular pulses to 
a 4-section low-pass filter built from a design by W. E. Thomson.” 
Filters were built to generate pulses 17, 1.57, and 27 wide at their 
bases. Oscilloscope presentations of the 17 pulses are shown in Fig. 
18c and of the 2T pulses in Fig. 18a. 

The measured spectrum of phase noise caused by the 17 pulses is 
shown in Fig. 35 along with the calculated values and it is seen that 
agreement is fairly good. This spectrum has appreciable energy at 
considerably higher frequencies than does that caused by mistuning 
or trigger offset and narrow rectangular pulses. The reason for this is 
that the dissymmetry of side frequencies extends to much higher 
frequencies. The same is true of course for the rectangular pulses 
wider than 7/T = 0.5. 

Measured and calculated spectra for the 1.57 pulses are plotted 
in Fig. 36 where it may be seen that the total amount of noise is about 
four times as great as that for the 17 pulses. 

Since there seemed to be no suitable way to generate the asymmet- 
rical pulses, no experimental data is available for this case. 

Some of the data obtained with rectified 27 pulses are presented 
in Fig. 37. It is difficult to duplicate experimentally the idealized 
waveform of Fig. 17b assumed in the calculations of this case. For 
example the wiggles at the top of the real waveform Fig. 18a come 
from small departures from ideal of the pulse shapes, and the rectifier 
does not produce cusps at its cutoff point but rounded transitions as 
in Fig. 18b. The data plotted in Fig. 37 agrees fairly well with cal- 
culation, and it is believed to be reliable. But other data has been 
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Fig. 35 — Spectrum of timing noise from raised cosine random pulses one time 
slot wide at base. No tank mistuning. 


obtained which gives spectra both smaller and larger than that shown. 

When the driving pulses are ac coupled to the shaping filter, there 
is a large increase in very low-frequency noise components as shown. 
This illustrates the statement made before that if there is low-fre- 
quency distortion in the transmission of pulses, then nonlinearity 
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Fig. 36-— Timing noise spectra for 15 7 and asymmetrical raised cosine 
pulses. 
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Fig. 37—Spectrum of timing noise from rectified raised cosine puses two 
time slots wide. No tank mistuning. 


which follows this can convert this distortion into very low-frequency 
phase noise. With ac coupling, the pulse waveform of Fig. 18(a) is 
changed to that of Fig. 38. 


4.5 Long Word Periodic Pulse Pattern 

In addition to the previous results obtained with a random pulse 
train, a few measurements were made using a periodic pulse pattern 
of period 240 time slots. 





Fig. 38— Photograph of random pulse oscilloscope traces for raised cosine 
pulses two time slots wide at base. AC coupling. 
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For a particular pattern chosen at random, the measured phase 
modulation on the recovered pulse rate agrees very closely with that 
calculated from the theory developed in Section II. Hence there is 
nothing in periodic pulse patterns as such to cause behavior different 
from that predicted by the theory. 

Another particular pattern, with one pulse per period missing was 
used to measure a sort of phase impulse response. Under this condi- 
tion phase deviation was observed and photographed under two con- 
ditions. Figure 39a shows the phase detector response for tank 
mistuning and Fig. 39b that for an offset trigger circuit following 
a perfectly tuned tank. In Fig. 39a, the mistuning is —0.2 percent 
and the peak phase deviation 0.73 degree. In Fig. 39b, the trigger 
offset is 12.6 degrees and the peak phase deviation is 0.438 degree. 
The wiggles on both waveforms are a residual noise in the system 
and have nothing to do with the phenomena being discussed. 





Fig. 39— Photographs of oscilloscope traces—Phase deviation of recovered 
fundamental pulse rate for pattern of 239 pulses, 1 space, with (a) tank circuit 
mistuning, (b) amplitude to phase conversion by means of trigger offset. 
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Both of these resemble the response of an RC circuit to an impulse. 
A simple picture of this situation then is that deletion of one pulse 
from the all pulses present condition is converted, by one or the 
other of the two imperfections considered, into an equivalent impulse 
of phase change which through the tank phase transfer characteristic 
produces one of the responses of Fig. 39. 

But there is a small, though significant, difference between the 
two responses. The simpler of the two, Fig. 39b, with its sharp 
rise and exponential decline is very close to an RC impulse response. 
Its measured time constant is about 30 ms which is quite close to the 
value of 32 ms which describes the phase transfer characteristic 
derived from sine-wave measurements. The Laplace transform of this 
pulse is F(p) = 1/(p + a). Its amplitude spectrum, with its finite 
value at zero frequency, is like the measured spectra of phase noise 
caused by amplitude to phase conversion. 

The pulse in Fig. 39a differs from the other, principally in its 
crossing, of the baseline and undershooting as it declines to zero. The 
observed pulse can be approximated very closely by the modified 
exponential f,(¢) = e-@‘(1 — at) which has the Laplace transform 


FO) = a 


The corresponding amplitude spectrum is 
oy) | = el. 
| F (jw) | gtd age 


It is seen that f,(¢) goes through zero att = t = 1/a and has a minimum 
value of —e™* = —0.135 at t — =2t . Also, it is seen that | F(jw) | 
has its maximum at w = a. From the photograph, we estimate that 
t) = 33 ms and that the minimum of 0.135 occurs at 2t, . Further, from 
the earlier work we find that both measurement and theory show a 
maximum in the spectrum of phase noise caused by tank circuit mis- 
tuning at w = w/2Q. Equating this to a gives 1/a = 32 ms which is 
very good agreement with the estimation of t . 

Thus the “impulse response”’ view of phase deviation in the funda- 
mental recovered by means of a narrowband tank circuit is consistent 
with the spectrum modification view worked out earlier. 


Vv. CONCLUSIONS 


Two important sources of timing noise in a self-timed regenerative 
PCM repeater, namely tank circuit mistuning and amplitude to phase 
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conversion by means of an offset trigger circuit, have been identified 
and studied both experimentally and theoretically. Another source, 
pulse shape and duration has been studied less than the other two. 
Another subject investigated is the way these noises accumulate 
along a chain of repeaters and how they combine. 

As a part of this investigation, a simple theoretical picture of the 
process has been developed whereby many important properties of 
timing noise and its accumulation along a chain of repeaters can be 
calculated with results which agree very well with corresponding 
measurements. 

The process of calculation begins with the spectrum of the pulse 
train, then proceeds to the modifications of it which arise as the 
pulse train is transmitted through the timing tank and retiming 
process of each repeater. The properties of the timing noise which 
appear on the retiming wave depend on the particular character of 
these modifications. The further modifications of the spectrum of the 
pulse train, acquired at each repeater, are used to find the way in 
which the timing noise accumulates along the repeater chain. 

These investigations have shown that the characters of the spectra 
of these timing noises are important in determining how they accu- 
mulate along a chain of repeaters. It has been found that the spectrum 
of the noise caused by tank circuit mistuning has a zero at zero 
frequency. It has been demonstrated that because of its property, 
the noise increases only for the first few repeaters of the chain, soon 
reaching a limit. Further, it has been found that the spectrum of 
timing noise which depends on amplitude variations of the timing 
wave (as in a trigger circuit where the firing point has been offset 
from a zero crossing) has a nonzero value at zero frequency. It has 
been shown that because of this property, this timing noise increases 
without limit along the repeater chain. The total amount of the noise 
varies inversely with the Q of the tank circuit. 

Thus, whether or not timing noise increases without limit along 
a repeater chain depends, not on whether the same noise is generated 
and added on at each repeater, but on whether or not the spectrum of 
the noise added at each repeater has a nonzero value at zero frequency. 

Study of the effect of pulse shape and duration have shown that 
while the total noise from this source is greater than it is for the 
others, the amount at very low frequencies is quite small, though not 
zero in a number of cases. These latter components are the result 
of aliasing of the high-frequency parts of the tank circuit response. 
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If there is nonlinearity in the fundamental recovery path, low- 
frequency pulse distortion during transmission can be converted to 
very low frequency timing noise. 
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Synchronizing Digital Networks 


By J. R. PIERCE 


(Manuscript received September 11, 1968) 


It appears that stable synchronization of large digital transmission 
networks should be easy, granted accurate clocks, buffers which accept 
pulses at the incoming rate and deliver them at the local clock rate, and 
adequate delay for making frames coincide. An electric network analog of 
a simple linear system in which the clock frequency depends on the fullness 
of buffers and the departure of frequency setting from midsetting makes 1 
obvious that the system is stable. System frequency should be made to depend 
strongly on accurate or master clocks; criteria are given for choosing param- 
elers to achieve this. Strategies are given for periodic infrequent adjustments 
to compensate for changes in transmission time, and for adding new clocks 
to the network. The practical realization of a synchronized network calls 
for more information concerning variations of transmission time and for 
adequate components, particularly, buffers and adjustable delay devices. 


The synchronization of digital networks has been studied theoret- 
ically and experimentally.** This paper does not purport to review 
excellent previous work, some of which has been highly theoretically 
oriented, through some results of earlier work are referred to. Rather, 
it discusses some problems of synchronization and illustrates them 
by means of a simple analysis of a simple example. We reach the 
following conclusions: that with good clocks, buffers and adequate 
delay both to compensate for changes in transmission time and to 
make frames coincide, there should be no trouble in stably synchroniz- 
ing a nationwide network. This is in accord with earlier analysis 
and experiment.® 


I. SYNCHRONIZING FRAMES OR BITS 


In some papers, synchronization has been discussed in terms of 
synchronizing frames, that is, successive groups of bits identified by 
some framing signal present in each group. ° In this paper, synchro- 
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nization will be discussed in terms of synchronization of bit streams. 

The choice of the bit stream as the signal to be synchronized is 
partly arbitrary. However, there may be advantages in performing, 
at a terminal, as many operations as possible on an accurately timed 
binary bit stream. 

For some purposes, and especially for time division switching, it 
seems essential to synchronize frames. This can be done by passing 
the bit stream through a suitable adjustable delay device. The delay 
measured in pulses, which is needed to make frames coincide, is not 
small. The frame time for a digital system designed for speech trans- 
mission is commonly 1/8000 second. If a transmission system runs 
at the rate of 5 X 10° pulses per second, the frame time includes 
about 70,000 pulses. This may be an akwardly large number of pulses 
to store. 


II. CHANGES IN TRANSMISSION TIME AND CLOCK RATE 


A scheme of synchronization must take into account both errors 
in clock frequency and changes in transmission time. These pose 
rather different problems. Both for this reason, and because it makes 
the presentation simpler, changes in transmission time are disre- 
garded in the earlier portions of this paper, and treated separately 
later on. 


III. THE SYSTEM CONSIDERED 


Various approaches to synchronization are possible. One solution 
would be to transmit synchronizing signals from a central master 
source. Few people seem to like this method because of problems 
of reliability. 

The system considered here is composed of a lot of centers (the 
dots of Fig. 1) with highly stable clocks, interconnected by two-way 
digital circuits (the lines of Fig. 1). The common frequency of opera- 
tion will be determined by the characteristics of the clocks and of the 
transmission circuits. 

We assume that each clock is equipped with a frequency control, 
so that its frequency can be adjusted to be above or below its central, 
“correct” value. We also assume that each receiver is equipped with 
a buffer which will accept a bit stream from a terminal at some 
received rate and emit bits at the rate or frequency determined by 
the local clock. 
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Fig. 1— A network of centers to be synchronized. 


We might assume a system in which each center knew the state 
of every other center’s buffers and clock settings. In Section XI we 
discuss how such knowledge may be used in dealing with changes in 
transmission time. Initially, we will consider the case in which each 
center knows only the state of its buffers and the setting of its clock 
frequency. Thus, any adjustments of the clock frequency will be 
based on the frequency of each clock with respect to its center value, 
and on the state of the buffers, each of which reflects both the clock 
frequency at the other end of a transmission circuit relative to the 
local clock frequency, and any changes in the transmission time. 

The case considered is illustrated schematically in Fig. 2. The 
elements concerned with automatic adjustment of the clock frequency 
are enclosed by a dashed line; they consist of the clock, buffers, and 
a network whose inputs are buffer readings and (optionally) the read- 
ing of the clock setting and whose output is a signal which adjusts the 
frequency setting of the clock. Other elements shown in Fig. 2 are 
adjustable delay for framing the bit stream from the buffer output, 
a decoder for going from received pulses to bit stream, and an adjust- 
able delay before or after the decoder which can be used to compen- 
sate for changes in transmission time. 
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Fig. 2— Block diagram showing components used in synchronization. 


IV. STABILITY OF CLOCKS 


It is clear that both the stability of the clocks and the pulse rate 
are overwhelmingly important. If the clocks were perfectly stable 
(if they could be exactly synchronized at the factory and if they main- 
tained their frequency exactly after being shipped to the centers), 
then the buffers would merely have to take care of fluctuations in 
transmission time. As the transmission time will not change without 
bound, finite, realizable buffers would insure the satisfactory operation 
of the system. 

The clocks cannot be synchronized perfectly, but will differ in rate 
by some fraction, d, which may be 10-* for a good crystal oscillator or 
perhaps as good as 10-? for an atomic clock. Comparatively inexpen- 
sive ($1,800) commercial frequency standards are now available which 
have a short-term d of 10“ and a long-term value of 2 x 10 (see 
Ref. 10). 

Consider two centers interchanging pulses at a rate r per second. If 
no effort is made to synchronize the clocks, the number N of pulses 
the buffer must absorb per day would be roughly N = 86,400 rd. 
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Let us consider some cases: 


r d N 
5 X 107 1077 4.32 X 10° 
5 X 107 2X 10-4 864 
5 X 107 10-2 43.2 


This is instructive. If the clocks were stable enough, the buffers 
could be dumped and the delays readjusted at strategic times, with 
resulting errors through loss of message bits. This might be feasible 
with the Hewlett-Packard clock.° If very stable clocks are used, 
any adjustments can be very slow or very infrequent. 


Vv. TRANSMISSION TIME 


The time delay in transmission is a complication in any analysis 
of synchronization. This makes the idea of infrequent periodic ad- 
justments or very slow continuous adjustments attractive. If the 
time intervals between adjustments or the time constants involved 
in the adjustment process are long compared with the transmission 
time, then a sort of kinetic model, in which transmission time can 
be disregarded, will apply. 

Transcontinental transmission time via coaxial cable is of the order 
of 0.02 second, and via microwave radio somewhat less. For a clock 
stability d = 10-° and a pulse rate 5 X 10®, around five pulses would 
accumulate in the buffers each second, and an adjustment once a 
second seems reasonable. For a d = 107° and the same pulse rate, 
5 pulses would accumulate in 100 seconds, a time long compared 
with the 0.3 second transmission time for a synchronous satellite. 

Thus, it appears from the outset that very slow adjustments of 
clock frequency are permissible. We will see later that very slow 
adjustments are desirable as well. There is every reason to believe 
that we can disregard transmission time in considering the stability 
and performance of the synchronizing scheme which we discuss in 
Section VIII.* 


VI. THE BUFFERS 


We have noticed that at each center we may make use of the 
departure from the center frequency setting of the clock frequency 


* A criterion for stability which is independent of transmission time has been 
available for some years In work to be published, I. W. Sandberg gives a less 
stringent criterion which is dependent on transmission time, and which is in 
accord with the qualitative statements made here. 
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adjustment and of the loading of the buffers, which we call b;, bs, and 
so on. These loadings may, with respect to their design or “normal” 
values, be positive or negative numbers. 

Buffers will have finite capacity and hence will overflow when 
clock frequencies at two interconnected centers are persistently dif- 
ferent. Here it is assumed that when a buffer overflows it remains 
completely full or empty until the sign of the difference in clock 
frequencies changes, at which time it again accepts and release 
pulses. It is also assumed that during the overflow condition the 
buffer reading b remains at some extreme value +b,,, and becomes 
smaller in magnitude once the sign of the difference in clock fre- 
quencies changes. 

The bounds imposed on the b’s by buffer overflow are valuable. 
The clock at the other end of a circuit may be in really bad trouble. 
In the extreme; no pulses may be coming in. Thus, we should not 
allow, or else we should disregard, buffer readings beyond some 
limiting extremes. As the buffers are finite, drastic malfunction will 
cause them to overflow and so limit the range of the buffer reading b. 


VII. CLOCKS OF DIFFERING ACCURACY 


In adjusting clock frequency, account should be taken of the 
accuracy of the clock. In a large system, it may be desirable to use 
very accurate master clocks at some important nodes and less ac- 
curate subsidiary clocks at other nodes. It is essential that some 
provision be made so that the final frequency of operation depends 
most strongly on the accurate master clocks and less on the less 
accurate subsidiary clocks. Thus, in adjusting the clocks, either the 
magnitude of the adjustments should be suitably smaller for the 
more accurate clocks than for the less accurate clocks, or else the 
adjustments of the more accurate clocks should be made less fre- 
quently or both. 


VIII. MATHEMATICAL DESCRIPTION OF THE SYNCHRONIZING SCHEME 

. Let us now consider one particular node of the network of Fig. 1, 
-at which the clock frequency is f;. This node is shown as 1 in Fig. 3. 
Connected to it are nodes 2, 3, 4,..., , where the frequencies are 
fe, fs, fa, fn. Lf the 1, n buffer is set to bing at t = 0, the various buffer 
readings at 1 are 


bin = Dano + fe — 40 at (1) 
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fs 


fa 
fn 


Fig. 3— A node of the network of Fig. 1. 


We will assume that b,, can be positive or negative. If we let 
bin = O when the buffer is half full, then the buffer reading can have 
any value in some range from —b,, to +bm where 2bm is the size of 
the buffer. 

Let the “center” clock frequency at 1 for “center setting” of the 
frequency control be fio. The center settings are intended to adjust 
the clock to the desired system frequency. The frequency fino of a 
given clock at center setting differs from the intended frequency 
because of the error of the clock. 

The buffer reading must be an integer. If the range of variation 
of this integer is great enough, it should be possible to treat the 
buffer reading as a continuous variable in a differential equation; 
this is what we will do. 

A suitable linear strategy for adjusting the frequency f; was found 
to be 


AL | -Wat— Bh - fo) — CB 4 AD dw = 0. 


We can regard the equation as applying either to control of the clock 
frequency or the rate of change of clock frequency. If we make C = 0, 
then the equation prescribes departure from center clock setting, 
(f: — fio), in terms of buffer content. If C ¥ 0, then the equation pre- 
scribes rate of change of clock setting, df,/di, in terms of buffer con- 
tent and departure from center clock setting. 

In either case B, should be larger for more accurate clocks and 
smaller for less accurate clocks. This will give the departure from 
center clock setting greater weight for more accurate clocks and less 
weight for less accurate clocks. 


IX. AN ELECTRIC ANALOG 


Let us now consider an electric analog, the circuit shown in Fig. 4. 
Node 1 is connected to ground through a capacitance C. Current. 
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Fig. 4— Electric analog of a node synchronization system considered. 


flows to C through a resistance R, to which a bias voltage Vio 1s ap- 
plied. Current also flows to node 1 because this node is connected to 
other nodes 2, 3, 4,...,, at which the voltages are V2, V3, Va,..., 
V., by inductances L. The differential equation for the voltage V; is 


w/b) Xf w.- vat 


= W/R)(Vi = Ve) — C+ Cw =0. ©) 


Here I1n9 is a current flowing into node 1, not from but associated with 
node n, at time t = 0. We see that equation (3) is identical with 
equation (2) if we let 


Vi = fn 
Vio = fro 
(1/L) = A w) 
(1/R,) = By 
Iino = ADino 
C=C 


The behavior of the frequency of a network of oscillators adjusted 
according to the strategy of equation (2) will be the same as the 
behavior of the voltage in an L, R, C network of the form shown in 
Fig. 5. We should notice that it is perfectly permissible to make 
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C = 0 in (2) or (8). A network analog for this case has been given 
by Brilliant.® 

In a network such as that of Fig. 5, no matter how extensive or 
how interconnected, all the voltages settle down to some final value 
because of the damping (energy loss) of the resistors R,. What is the 
final voltage? It must be such that the total of the currents flowing 
to all nodes is zero. This means that 


0 = DX ps Tamo + a (Vino — V)/R, (5) 


The double summation is in each case over all nodes. Not all nodes 
are connected to one another, and Inmo will be zero for » = m and for 
all nodes n not connected to m. 

The analogous expression for final frequency is 


O=A De De Damo + 2 Balfro — f). (6) 


Again, Dymo will be zero for n = m and for all nodes n not connected 
to m. 

We will see that the final frequency depends on the center fre- 
quencies of the oscillators, on the A and B coefficients, and on the 





Fig. 5 — Interconnected nodes of electric analog. 
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initial buffer settings. If the initial buffer settings bamo are all zero, 
the final frequency depends only on the fno’s and the B,’s. If the buf- 
fers overflow, this in effect resets them. 

We can see from Fig. 5 that the system will be stable in the case 
of certain nonlinearities. 

For example, all the R,’s can be nonlinear as long as no resistance 
is ever negative at any current. Thus, if in the equations of the oscil- 
lator system the second term on the right of (2) is replaced by a 
nonlinear function of (f; — fio), the system will be stable as long as 
the term decreases monotonically with increasing fi. In the case 
C = 0, this corresponds to a nonlinear control of clock setting as a 
function of buffer content. 

It is easy to show that in a special case buffer overflow cannot 
result in instability. This is the case in which the buffer readings at 
two ends of a transmission circuit are complementary, that is, their 
sum is zero. Disregarding changes in transmission time, if the buffers 
are both set to zero or to complementary values at the same time, 
or if both buffers overflow and then recover, the readings will be 
complementary. 

Appendix A shows that for this case, in the electric analog of Figs. 
4 and 5 buffer overflow results in the dissipation of energy, and 
this convinces the writer that in this case overflowing buffers cannot 
make the network unstable. The writer is mortified that he is unable 
to demonstrate this for the case of non-complementary readings, but 
he suspects that buffer overflow will not result in instability in this 
case, either. 

Buffer overflow would affect the final frequency of operation. Fur- 
ther, all the buffers at a given node can conceivably overflow per- 
manently (if the clock center rate shifts drastically, for example). 
In such a case, the node will operate out of synchronism with the rest 
of the network. This will cause buffer overflows at nodes connected 
to the asynchronous node. Such overflows can affect the frequency of 
operation of the rest of the network, but need not prevent its syn- 
chronous operation. 

If widespread buffer overflow is avoided, adjustment according to 
equation (2) will result in stable operation. 


X. PARAMETERS AND TIME CONSTANTS 


Let us consider equation (2) with C = 0. By using (1), this can 
be written 
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hi — fio = (1/T,) Es Din : (7) 


T, = B,/A = L/R, . (8) 


We see that T; is a time constant. Equation (7) prescribes the de- 
parture of the clock frequency from center setting in terms of the 
sum of buffer readings bi, for the various transmission circuits ter- 
minating at node 1. How shall we choose the parameter T;? 

The smallest amount by which the sum of the buffer readings can 
change is unity. The frequencies f, and fi differ by a very small 
amount. Thus, the fractional change in frequency caused by unit 
change in the sum of the buffer readings can be written 1/T;f1. If we 
are to take full advantage of the stability of the clock at the node, 
we should make this smallest change small compared with the clock 
stability expressed as a fraction, which we call d;. Hence, we should 
choose 


Thi <es T, > 1/dfi . (9) 


If this is not so, changes in buffer readings will cause sudden changes 
in frequency larger than the changes which would occur if the clock 
were not adjusted. 

The buffer readings must, however, be able to change the frequency 
of the clock by several times its fractional stability d, if we are to 
be sure to bring all the clocks to the common intended frequency. 
Strictly, it might be possible to accomplish this if (1/71f1) bm > di/n, 
where b,, is the maximum buffer reading and n is the number of buf- 
fers. It would seem wise to choose b,, large enough so that this criterion 
is considerably exceeded. We might reasonably ask that 


bn/ Tif, > di, ba > Tifid: « (10) 

As an example, let us consider a case in which 
T, = 10/d:f, (11) 
Bin —_ 107, f,d, = 100. (12) 


Assume that f,; = 5 X 108. For various values of d,, the computed 
values of T; are: 


Fractional Stability 
oscillator (pulses per day) Time constant, 
stability, di 86,400 di T1 (seconds) 
~ 1078 4.32 X 105 2 
2x 10-8 864 1000 (17 minutes) 


10-2 - 43,2 20,000 (5.5 hours) 
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The time constants 7, are large, implying that the time for the system 
to come to equilibrium is large. The time constants may not seem 
excessive, however, when we consider oscillator stability measured in 
pulses per day. It would be rash to try to adjust an exceedingly stable 
oscillator in too short a time. Further, time jitter in the received 
pulses (see Appendix D) and perhaps other quickly changing phe- 
nomena could cause undesirable changes in operating frequency if 
T, were made smaller. 


XI. COPING WITH CHANGES IN TRANSMISSION TIME 


We have not yet considered the effect of changes in transmission 
time. Changes in buffer reading have been ascribed to differences in 
frequency. But changes in transmission time can also cause changes 
in buffer reading. 

Consider two interconnected nodes. If the clock at one speeds 
up, the buffer at the node will tend to empty and the buffer at the 
node to which the fast clock is connected will tend to fill. Thus, a 
change in clock rate will change the buffer readings at interconnected 
nodes in opposite sense. 

Consider an increase in transmission time between nodes, for ex- 
ample, an increase in transmission time in both directions. Because 
of the increase in transmission time, more pulses will be stored in 
the lines and the buffer readings will decrease at both ends. Thus, 
changes in transmission time will change the buffer readings at inter- 
connected nodes in the same senses. 

If we can compare the buffer readings at two interconnected nodes, 
we can distinguish changes caused by changes in transmission time 
from changes caused by changes in clock frequencies. Once we iden- 
tify a change in transmission time, we can correct for it by means 
of an adjustable delay between the transmission system and the 
buffer input. Such corrections have been provided in some synchroniza- 
tion schemes.® It is not clear, however, that an automatic system 
acting in the same way among all nodes is best in coping with 
changes in transmission time. 

Another course would be to use no adjustable delay, and merely 
provide buffers large enough to accommodate changes in transmis- 
sion time. Then, changes in transmission time would cause buffers 
to fill or empty. This would have some effect on system frequency. 
If the system included highly stable clocks, such changes would neces- 
sarily be very small. 
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What should be done about changes in transmission time depends 
on the magnitude of such changes and on how rapidly they occur. 
Unfortunately, adequate information is not available. 

In cable and waveguide systems, transmission time can vary with 
temperature and with the gas pressure within the waveguide or cable. 
What is known is discussed in Appendix B. For a circuit 3,000 miles 
long, we might expect variations of more than a thousand pulses over 
the year. However, because cable is buried and waveguide would 
be, we would expect the transmission time to vary little during the 
day, or over a period of several days. Experience with the L4 system 
tends to confirm this. 

If the short-term stability of transmission time of cable and wave- 
guide is as good as would seem, it would be satisfactory to make 
compensating adjustments in delay at intervals of days or weeks; this 
would argue for a scheme of adjustment separate from that used for 
clock synchronization. 

Variations in transmission time for microwave radio systems (see 
Appendix C) may be comparable to those for cable or waveguide sys- 
tems, but changes may be more rapid. Diurnal changes might be 
taken care of by the buffers, and slower changes by daily delay 
adjustments. 

A node may often be connected to the rest of the network by cable 
and/or waveguide circuits as well as by microwave circuits. In this 
case, it seems attractive to the writer to use the cable or waveguide 
circuits for adjusting clock rate. Then the buffers and delay adjust- 
ments associated with the microwave circuits could be used solely 
to compensate for changes in microwave transmission time. 

Thus, a somewhat mixed strategy of adjustment may be called 
for. The following seems reasonable to the writer: 


(2) If possible, avoid the use of microwave circuits for synchroniza- 
tion. Use parallel cable or waveguide circuits for synchronization, and 
use buffers or adjustments of delay to absorb changes in microwave 
transmission time. 

(i) The system will probably contain several master clocks which 
are more accurate than other subsidiary clocks. The parameters T 
will be chosen [according to (9) or (11)] so that frequency is chiefly 
dependent on the center frequencies of the master clocks. Hence, 
the correct function of the buffer readings at the subsidiary clocks 
is to adjust these clocks to the (correct) operating frequency. At 
each node with a subsidiary clock, periodic adjustments of delay 
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should be made, such as to make all buffer readings zero. At the same 
time a delay adjustment should be made to make any buffer at a 
master clock on a line from the subsidiary clock zero. Simultaneously, 
the subsidiary clock should be readjusted so that its center frequency 
fno 18 equal to the current operating frequency f,. This adjusts for 
changes in transmission time by making buffer readings at the ends 
of links to and from subsidiary clocks complementary (and equal to 
zero). It may be desirable to make these adjustments at a common 
time at all nodes concerned. Notice that in making the adjustments at a 
node, no knowledge of buffer readings or clock settings at other nodes 
is needed. Such adjustments might be made once a day or once a week. 

(wi) It is desirable that the network link all master clocks together 
by cable or waveguide, directly or indirectly, but with no intervening 
buffering and retiming by less accurate clocks. When this is so, it 
would seem desirable to make periodic adjustments of the delays in 
each transmission circuit connecting two master clocks, such as to 
render the buffer readings complementary at the two ends of the 
link. This compensates for changes in transmission time. It is unde- 
sirable to adjust the frequency at center setting, since we rely on the 
frequencies of the master clocks at center settings to determine the 
operating frequency. This adjustment of delays need be made only 
infrequently (once a day or once a week). To make it, we must know 
at each master clock the buffer readings at the other ends of the 
circuits connecting it to other master clocks. 

(iv) In adding a subsidiary clock to a network, it should be ad- 
justed so that its frequency at center setting is equal to the current 
operating frequency and the buffers at both ends of all circuits con- 
necting it to the network should be set to zero. If a master clock 
is added, the adjustment of frequency at center setting should be 
omitted. 


XII. CONCLUDING REMARKS 


The behavior of a simple scheme of synchronization was investigated 
and found to be stable in the absence of buffer overload if clock 
adjustments are sufficiently slow compared with transmission time. 

The criteria given for choice of parameters result in time constants 
very long in comparison with transmission time. 

A strategy of infrequent periodic adjustment (once a day or once 
a week) has been suggested. This can compensate for changes in 
transmission time and correct the frequencies at center setting of 
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subsidiary clocks. A strategy has been given for adding new clocks 
to the system. If these strategies are followed, and if parameters are 
chosen as prescribed, it seems likely that the buffers will not overload 
except in case of clock failure. Clock failure will cause loss of syn- 
chronization at a node. 

It has been shown that in a special case, buffer overload from 
other causes will not result in instability; it seems plausible that this 
is sO in general. 

It appears that there are no inherent obstacles to the synchroniza- 
tion of large digital networks. In the practical realization of such 
networks it would be desirable to have more information concerning 
the variation of transmission time with time, and on the availability 
of suitable components, including: 


(t) Adequate buffers which will accept pulses at one rate, emit 
them at another, and behave under overload in the fashion described 
earlier. 

(w) Adequate delay of the order of 10° pulses, to bring frames 
into coincidence. 
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APPENDIX A 


Buffer Overflow 


The purpose of this section is to consider the consequences of buffer 
overflow by studying the behavior of the electrical analog. 
Consider equation (1): 


bin = Dian + | fy — fi) a (1) 

Because of the finite content of the buffer, if for bin, = 0 the buffer 
is set half full at t = 0, we must have 

—b, < by <b, , | bin | < 6, (13) 


_ One satisfactory way to treat buffer overflow is to let Ding of (1) 
change during time intervals when the integral would otherwise cause 
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an increasing violation of (13). For example, suppose that the integral 
is increasing with time. When b;, reaches the value bm, bino starts to 
decrease and decreases so as to make bi, = b,, for as long as the in- 
tegral continues to increase. As soon as the integral starts to decrease, 
bino Temains constant (until another buffer overflow) at whatever 
value it had when the integral started to decrease, and b,, starts to 
decrease. A little thought shows that this results in just the behavior 
that an overflowing buffer of the type described in the paper would 
exhibit. 

In equations (3) and (4) and in Fig. 4 we see that the exact analog 
of bin is Lin given by 


i= ae + (/L) | “(V4 — Vi) a}: (14) 


In exploring the effect of buffer overload it is sufficient to consider 
a circuit element consisting of an inductor and two bias currents. 
For simplicity we will assume that these bias currents are equal and 
opposite, with magnitudes J,, as shown in Fig. 6. The input and output 
currents are thus equal and of magnitude J. The input and output 
voltages are V2 and V,. The current J;, through the inductance ZL is 


| oe ae oe ee (15) 


Assume the same buffer overload current at each end, of magnitude 
I,. Thus, the magnitude of the current J cannot become greater 
than In. 

Suppose that J = 0 at ¢ = 0 and the voltages are such as to in- 
crease the magnitude of J. How much energy have we put into the 
circuit by the time J = J,,? This energy EF, is 


Im 
Lin = (V2 i, Vit dt. (16) 
I=0 
Now 
_7,a,_ 7a _ 7a. 
Ve- Vi = LSaLEI +1) =L5 (17) 
Hence 
Im 
fet i: I dl = (1/2)LF, . (18) 
0 


Notice that H, does not depend on J, and that it is recoverable, 
that is, we get it all back if we change the current from J,, to 0 in 
such a manner that the magnitude of I is always less than I,,. 
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The voltage difference V2 — V, can cause the current I; to change 
even after I has reached its limiting magnitude I,,. I, must then 
change to keep the magnitude of I from exceeding [,,. In this regime 
of buffer overload, 


f= 1. (19) 
To Ig 
tet 
I I 
V2 _ vi 
IL 


Fig. 6 — Electric analog of buffer, used in studying buffer overflow. 


I,=I,+TIn. (20) 


What about the energy # that is supplied to the circuit during 
this period? This energy is 


B= : "“WeS WSLai 


ae Lor, dt 
oe LIL 12 —_ Tr) 
= LI,,(Lo2 ca I 41). (21) 


Overload operation persists only as long as J, is increasing. If 
we come to a point where J, would decrease, we hold J, constant. The 
buffer is no longer overloaded and we return to the regime of a fixed 
bias current. 

Thus, 


Loe > Loy (22) 
E> 0. (23) 


In fixed bias operation, the recoverable energy is merely Hi, as given 
by (18). The positive energy # has been dissipated. Hence, the effect 
of buffer overload is to dissipate energy, and buffer overload cannot 
result in instability. 
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APPENDIX B 


Cable and Waveguide 


The transmission time through cable or waveguide can change 
for several reasons. The gas pressure in a cable is controlled, and 
changes in gas pressure cause changes in transmission time, as could 
changes in gas temperature. While large changes in pressure could 
produce large effects (see Appendix C), it seems likely that another 
effect will dominate. 

This is the linear expansion of the cable or waveguide because of 
changes in temperature. Structurally, waveguide would probably be 
largely steel; cable might be considered as copper. The thermal coef- 
ficient of expansion of steel is about 12 x 10-6; for copper it is about 
18 X 10-*. For a change in temperature of 40°F or 22°C, the frac- 
tional change FC in length would be approximately 


Fractional change in length, FC, 


Material for 22°C change in temperature 
Steel 260 X 10-* 
Copper 400 X 107° 


During an experiment on the L-4 field installation in Dayton, Ohio, 
T. J. Pedersen observed phase shift versus time of a 12 MHz sine 
wave sent through an 84-mile loop of the L-4 system. During a 24- 
hour period he measured a peak-to-peak change of about 1.5°. The 
velocity of propagation is about 175,000 miles per second, so the 
total phase shift in the 84-mile loop was about 2 x 10° degrees. This 
is a fractional change of 


0.75 X 107°. 


During the time that this change took place, the temperature of 
the cable was changing about 0.3°F per day. If this temperature 
change was indeed the source of the phase shift, the fractional change 
in transmission time for a 40°F change in temperature would be 


FC = (0.75 X 107°)(40/0.8) = 100 X 107°. 


This value may not be accurate, but an estimate based on the thermal 
expansion of a metal may not be accurate either. 

Change in temperature increases the resistivity of copper, and 
this causes a change in the reactive as well as the resistive component 
of. skin impedance. Further, the capacitance may change with tem- 
perature. Both calculations and measured values of cable: parameters 
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as a function of temperature indicate that such effects will change 
transmission time much less than linear expansion. 

Change in diameter of a waveguide causes a change in group 
velocity. The change in transmission time with temperature which 
this would cause is considerably smaller than a change proportional 
to linear expansion. 

It may be conjectured whether cable or waveguide is or need to 
be free to expand in length as temperature changes. We have no 
waveguide systems in operation at present, and variations of trans- 
mission time for coaxial cable systems have not been adequately 
measured. We can only conclude: 

(2) Change in transmission time will be very slow, so that infre- 
quent adjustments would be satisfactory. 

(iw) Total fractional changes in transmission time may be from 
100-400 parts per million. 

For a path length of 3,000 miles, a velocity of 175,000 miles per 
second and a pulse rate of 5 xX 10° pulses per second, the number of 
pulses stored in the line would be 10’. For the fractional changes in 
transmission time quoted above, the changes in number of pulses 
stored in a 3,000 mile cable would be 


Change in number 


Fractional change in transmission time of pulses stored 

0.75 X 10-* (observed change in one day) 7.5 
100 X 10-* (estimated effect of 40°F 

change in temperature) 1000 
220 X 10-* (from expansion of steel caused by 

40°F change in temperature) 2200 
400 X 10-* (from expansion of copper caused by 

40°F change in temperature) 4000 
APPENDIX C 


Microwave Radio 


The velocity of radio waves traveling through the atmosphere de- 
pends on temperature, pressure, humidity and probably on rain. 

The effect of the first three factors is given in terms of N units. In 
terms of the index of refraction n (ratio of velocity in vacuo to 
velocity in the medium) 


N = (n — 1)10°. 


A simple expression gives N quite accurately™ 
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= 118 / “) 
N = “a |p + 4,810 5 


I 


Dp total pressure in millibars 
eé = partial pressure of water vapor in millibars 
T = absolute temperature °K = °C + 273. 


Here we are concerned with rough estimates of changes in N over 
short and long periods. 

The diurnal change in temperature may account for the most rapid 
fluctuations in N. For a constant pressure and disregarding water 
vapor, N is inversely proportional to absolute temperature. For a 
20°C (86°F) change in temperature around 20°C (68°F), at a pres- 
sure of one bar the change in N would be about 18. 

Data taken over a six-year period from forty-five U. S.° weather 
stations show that the monthly mean value of N at the earth’s sur- 
face (N;) varies from 230 to 400 over the country and through the 
year. In the U. S. the largest local variation in the monthly means 
is on the southeast and Gulf Coasts and amounts to a charge of 
N of 50 units. 

During a given month, the variation in N, (for one and ninety-nine 
percent probability) can be as much as 100 N units. 

Concerning rain, it has been calculated that 150mm per hour rain 
over a 1 km path will introduce a phase shift of some 500 degrees at 
30 GHz.¥2 This is equivalent to a change in N of 14 units. 

It is not easy to arrive at a reasonable estimate of short-term 
and long-term changes in the average value of N over a long trans- 
mission system. On the basis of the foregoing data, it appears to the 
writer that changes during the day would probably not exceed 20 
units, while changes during the month might be as much as 100 
units, and changes during the year might be several hundred units. 

If we assume a 3000 mile path, the nominal transmission time is 
about 0.016 second, and at a pulse rate of 5 x 10° pps the number 
of pulses in the transmission system will be 8 x 10°. The change in 
this number of pulses will be the number times N times 10-°. For 
some values of change in N, the change in number of pulses will be 


Change in number 


Change in N of pulses 
20 160 
100 800 


200 1600 
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Jitter Due to Regenerative Repeaters 


In an experimental digital repeater line? the systemic jitter intro- 
duced by a single repeater had an rms value 6, of about 3.3°. The 
rms jitter after NV repeaters, Oy, 1s given by** 


PW) 


Ov = On! BG) (24) 
_N_ QN—D)! 
POO 4XIN — 1 5) 


The function P(N) has been tabulated: P(1) = 0.250 and for 
N > 100, P(N) = N/2. Thus, for a large N > 100 number of 
repeaters, 


6, = 0,+/2N (26) 
Some computed values of 6, are: 
Number 62, rms phase jitier, 
of repeaters (degrees) 
1 3.3 
100 47 
300 81 
1000 148 
3000 256 
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Extension of Bode’s Constant Resistance 
Lattice Synthesis of Transfer 
Impedance Function* 


By S. Y. LEE 
(Manuscript received August 30, 1968) 


Bode developed some explicit formulas in terms of the poles and zeros 
of the transfer impedance function for each element of the first and second 
degree constant resistance lattice structures. To extend work in this area, 
we derive explicit formulas for two of Bode’s structures using coupled coils; 
we give two new structures which avoid coupled coils. Illustrative examples 
show the usage of these formulas. Finally, we include a general procedure 
for synthesizing any physically realizable, rational transfer impedance 
function by a constant resistance lattice network. A flow chart aids in 
detailing this procedure. 

With the addition of these results, a general method for synthesizing any 
physically realizable, rational transfer impedance function with explicit 
formulas is complete. The explicit formulas method developed in this paper 
gies more rapid results and introduces fewer round-off errors than the 
step-by-step procedures used in the past. 


I. INTRODUCTION 


An important characteristic of constant resistance lattice networks 
is the absence of reflection effects when such two-port lattice net- 
works are connected in tandem. The synthesis of a given transfer 
impedance function is simplified by representing the function by a 
partial product expansion. Thus the transfer impedance may be rep- 
resented by a tandem connection of a number of constant resistance 
structures (one for each partial product). This process will result in 


*Some results presented in this paper are based upon the author’s thesis, 
“Explicit Formulas for Constant Resistance Lattice Synthesis of Transfer Im- 
pedance,” presented to the Moore School of Electrical Engineering, University 
of Pennsylvania in December 1965 in partial fulfillment of the requirements for 
the degree of Master of Science in engineering. 
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realizable, simpler, transfer impedance functions provided that the 
constant multiplier of the given transfer impedance function is made 
large enough to permit each of the constituent networks to have non- 
negative loss on the real axis.t In general, then, there will be addi- 
tional fixed loss for the overall two-port network. 

For physical realizability, it is required that both members of any 
conjugate complex pair of zeros or poles be retained within a given 
partial product. Hence, each of the elementary constituent networks 
must be represented by a biquadratic factor. When there are single 
zero and single pole pairs on the o axis, the partial product factor 
for each pair is reduced to the bilinear form. It is recognized that 
the expansion can be performed with the zeros and the poles col- 
lected in a variety of ways and assigned to the individual networks. 
The elementary lattice networks for these bilinear and biquadratic 
factors are first and second degree constant resistance lattice struc- 
tures, respectively. Therefore, one can realize a complicated rational 
transfer impedance function, to within a constant loss, by a combina- 
tion of elementary structures in tandem. 

Bode? developed the basic first and second degree structures, which 
are given in Fig. 2. They cover all the possible pole-zero combina- 
tions. Furthermore, he derived the explicit formula for each element 
of structures I to VI in terms of the poles and the zeros of the transfer 
impedance function. The object of this paper is to extend work in 
this area. Explicit formulas are obtained for structures VII and VIII 
and for two additional structures (Fig. 7). The structures of Fig. 7 
avoid coupled coils. Illustrative examples are given to show the usage 
of these formulas. Finally, a general procedure for synthesizing any 
physically realizable, rational transfer impedance function by a con- 
stant resistance lattice network is included. This procedure is de- 
tailed with the aid of a flow chart. The appendix gives a method 
of obtaining the physical realizability conditions for one of the struc- 





Fig. 1 — General constant resistance lattice network. 
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tures as an illustration. Reference 2 supplies the derivations of the 
physical realizability conditions of other structures. 


II. DEVELOPMENT OF SECOND DEGREE CONSTANT RESISTANCE LATTICE 
STRUCTURE INTO GENERAL FORMULAS 


The transfer impedance function exp 6 of the constant resistance 
lattice given in Fig. 1 of second degree can be written as the biqua- 
dratic factor 

i, (s — a,)(s — a2) 
op 9 37, = be — by ” 
where exp @ is related to the series branch impedance 2, by the ex- 
pression 


2 [exp 6| — 1 _ p2 _ 
2S Teepe a) 1 for 2.2, = Ro = 1 (2) 
or 
= Less: 
exp 6 = hoe (3) 


Hence z, is a biquadratic of the form 


As’ + Ass + A, 


13 Age 4 Ae + Ad 4) 

The solution for A;’s can be expressed in terms of the zeros and the 

poles of the transfer impedance function exp 6 by setting (1) and (3) 
equal 


K sx’ — (a, + a)s + ad, 
’ —_ (b; + be)s +- b,b. 


_ (Ag + As)s* + (Aa + As)s + (Aa + A) (5) 
. (A, — A;)s” + (A, — As)s + (A, — A,) 


For convenience let 





a, = a + a, ’ A, = aa, (6) 
and 


By — b; + be ) Be = bb, ° (7) 


Then equating coefficients in (5) and solving for A,’s yields 


Ay =-Hlen(Ae+ As) —Bolde- Ad] - - Q) 





STRUCTURE 


GENERAL FORM 





(S-a,-ja2) (S-a,+jaa) 
(s-b,-jbz) (S-b, +jb2) 














REQUIREMENTS FOR TYPICAL ATTENUATION 


PHYSICAL REALIZABILITY | PHASE CHARACTERISTICS 


Ja] =|bl 


[a] = |b] 





Fig. 2— Constant resistance lattice equalizers. 
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GENERAL FORM 


STRUCTURE OF EXP 6 








K (S-a,) (S- ap) 
(s-b;) (s-b2) 





REQUIREMENTS FOR 
PHYSICAL REALIZABILITY 


TYPICAL ATTENUATION 
PHASE CHARACTERISTICS 











|a,ap| = |b; ba| 
b? +b,? << a?+a7 


THE bs ARE COMPLEX 


THE a/s MAY BE REAL 
OR COMPLEX 











kK (s-a,) (S-a) 
(s-b,) (S-ba) 





|a;a2| = |b; ba} 


spol 


{ { { 
ae ee ee 

=a2tse 

are as 


bebe 


THE b's ARE COMPLEX 


THE @S MAY BE REAL 
OR COMPLEX 
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GENERAL FORM 


STRUCTURE OF EXP@ 


kK (S-a,) (S- aa) 
(s-b,) (s-b2) 


K (s-a,) (S-aa) 
(s-b,) (s-b2) 





REQUIREMENTS FOR TYPICAL ATTENUATION 


PHYSICAL REALIZABILITY | PHASE CHARACTERISTICS 


| a Aa| = |b, ba| 
be +b? 2a?+aZ 





THE a's ARE COMPLEX 


THE b’s MAY BE REAL 
OR COMPLEX 


|a; Aa] = |b; bal 


{ { { 
— «> -—- + 

2 ~pn2 a 
bs af as 





{ 
—+ 
be 


THE a’s ARE COMPLEX 


THE DS ARE REAL 
OR COMPLEX 
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A, =a zla(As i As) ee Bol As pa As)] (9) 

A; = zlai(Asg =F As) = Bi(As — A;)] (10) 

Ay = z[ai(As te As) = Bi(As —_ As)] (11) 
A, +A; 

| Ace A, (12) 


Expressing exp 6 on the real frequency axis we have 


ay __ Ki(aa, — «”) — (a, + Ay) joo] 
op 8) = Tob, — a) (0, Baldo 
where 6 = a + j8 may be called the transfer loss and phase. From this 
the expression for the transfer loss is obtained as 


K*[(=o" + aide)” + (a, + a2)*o*) 


(13) 


= 2 2 22 1 
exp (2a) =a a bb.) we (b, ti bs) e ( 4) 
by letting 
k = K? and z=w (15) 
and from (6) and (7), equation (14) becomes 
Le gaa 2 
exp (2a) = Kos — x)" + ax (16) 


(6, — 2)” + Bie 
It can be shown that in general the attenuation characteristic of a 
lattice for which exp 6 is a biquadratic function exhibits a minimum 
at a real frequency.t One can shift this minimum loss to have zero 
loss at that particular frequency; thus the transfer impedance ob- 
tained will be within a constant loss. By doing this the attenuation 
characteristics of all elementary structures will have zero transfer 
loss at one frequency w». Corresponding to wo, exp (2a) = 1. Thus k 
can be determined in terms of the zeros and the poles of the transfer 
impedance function by (16). If Ag is equated to unity, As is obtained 
by (12). With the relationships (8) to (11) one can determine A,’s 
in terms of the zeros and the poles of the transfer impedance function. 
Hence from (4) and (2) z, and z, can be obtained respectively. 


III. EXPLICIT FORMULAS FOR STRUCTURE VII 


The physical realizability conditions and the typical attenuation 
characteristic of this structure are given in Fig. 2. Zero attenuation 
occurs at a frequency w); therefore we must choose k such that the 
attenuation becomes zero at the same frequency. 
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For zero attenuation, exp (2a) must be equal to 1; thus from (16) 
after rearranging and letting w2 = x, we obtain a quadratic in 2o 


(k ae 1)x5 a (Koes — 2a.k + 28, — B3)Xo =e (kas -* 63) = 0 (17) 
which can be written compactly as 


azz + br +e = 0 (18) 
where 
=k—-1 (19) 
b = (ai — 2a)k + 28, — Bi (20) 
c = kei — 62. (21) 


In order that the frequency be real and the attenuation equal to 
zero, the solution of (18) must have a double root at x. This condi- 


tion holds only when the discriminant b? — 4ac = 0. First we find 
from (18) 


—b 
= (22) 


Secondly we obtain the following quadratic in k 


to = 


Ak’ + 2Bk+C =0 (28) 
where 
A = ai(a} — 4a) (24) 
B = (aj — 2az)(2B2 — Bi) + 2(62 + a2) (25) 
C = Bi(6i — 462). (26) 


It can be shown that the larger root of (23) must be used to insure 
that z, is a positive real function.” Denote this larger real root by 
k.,. Thus, from (15) and (23) 


_ ae _ {-2B + [(2B) — acy 
Kem — Kn > ‘ 2A max 


Hence Km can be obtained quite easily and it is in terms of the zeros 
and the poles of exp @. 


(27) 


* Notice that km as well as the discriminant of (23) must be positive and. 
real because of the physical realizability conditions. By considering all the 
possible sign combinations for A, B and C, one of the positive roots of Km is 
greater than, or equal to, unity. 
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By substituting K,, into (12) and letting Ag = 1 we obtain 


K, — 1 
K,+1 
Notice that from (12) and (28), the root K,, must be greater or equal 
to unity in order for z, to be positive real functions. 

Using (8), (9), (10) and (11) we can determine A;, Az, A3 and Ag. 
Then multiplying each coefficient by K,, + 1 we get 


A, = 





(28) 


A, = aKn — Bo (29) 
Az = aK, + Bo (30) 
A; = Bi — Kana, (31) 
A, = —6, — K,a, (32) 
A, = K,—1 (33) 
A,=K, +1. (34) 


Thus the coefficients of z, and K,, are expressed implicitly in terms 
of the zeros and the poles of the transfer impedance function exp 6. 
Furthermore from (83) and (84), we must have the positive root 
K,, > 1 for A,’s to be positive. 

Before considering the realization of the biquadratic z, in (4) with 
its coefficients given from (29) to (34), we will show that z, is a 
minimum resistance function. 

Rewriting (3) as 


L+z2._ (L+R,) + 5X, 





ee hee, Xe ey 
where 
z,= R,+ jX., (36) 
the corresponding magnitude is 
1+ Rk,’ + xX? 
exp (2a) = ease (37) 
Since we require that 
exp (2a) = 1 at a (88) 


then #, must equal zero and hence z, must be a minimum resistance 
function. 
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Now we can determine the element values for structure VII by using 
the results given in Chapter 4 of Boghosian and Bedrosian, in that 
the element values of a Brune network were expressed explicitly in 
terms of the coefficients of a minimum resistance biquadratic im- 
pedance function.* Since we have shown that the biquadratic function 
in (4) is also minimum resistance, it is a simple matter to express 
the element values in terms of the zeros and the poles of exp 6. The 
case when the coefficients of z, satisfy the inequality 


[A;A2]* _ [A,Ao]? <0 (39) 


is suitable for structure VII. Thus the Brune network for the series 
arm Zz, of structure VII is shown in Fig. 3, where the equivalent-T 
is used instead of the transformer. Since 2,2, = 1, we have for the 
cross arm of these lattices 


Aes + As + Ap 





(40) 


4 


Ag + Aw + Ay 


peste (Km-1) 2 L pris (a; Km-Ai)Ral2 
> TR (Km+t) a ag! (1-Km) Ce 


_ a@2Km~- Be 
aoKmt Ba 








Km! Km -1 


w? = [ost ae) : taste 





ie { [(a;+a2) Km-(b, +bo) | R3 2 
: Og (1-Km) Ce 

Co (a, + az) Km + (b, + bz) 
2=- 





a; a2 Km- by ba 


_ 4,2 Km—b, be 
~ ayaa Km+by be 


Fig. 3 — Series arm z; for structure VII of Fig. 2. 
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i 1 
Pye Cn rg Rz (Km -1) ]2 
“we | (1-Km)Cs e “L (Km+1) 






{ 


Ls = => 
Cs Woe R a2Kmt Bz 
a2Km — Be 
é _—(a;Km- Bi) 
ae 
azKmt Be 






vee (etealate =a] 


2 
Km =] 





1 

_ 4 | (+82) Km +(by +b2) }2 

L,= cee 
Wo (1-Km) Co 

_ (a1 +a) Km ~ (b; + ba) 


Cz = 
8; Aa Km +d, Do 


_ 442 Km+b, Do 
_ a;a2 Km—D, ba 


Fig. 4 — Cross arm 2, for structure VII of Fig. 2. 


Then z, is given by the network in Fig. 4 where the values of K,,, 
the zeros and the poles are the same as those given in Fig. 3. 


IV. EXPLICIT FORMULAS FOR STRUCTURE VIII 


The physical realizability conditions and the typical attenuation 
characteristic of this structure are given in Fig. 2. It can be shown 
that general formulas in terms of the zeros and the poles of exp @ for 
K,, and the A,’s are the same as those for structure VII, with the 
exception that for structure VIII the coefficients of z, satisfy the fol- 
lowing inequality 

[A;Ao]* ome [A,A.]? > 0 (41) 


yielding a positive sign for the reactance jw.L, and a negative sign for 
the reactance jw .L3 of Fig. 3. Similarly, the cross arm z, of structure 
VIII yielding a negative sign for reactance jw,.L, and a positive sign for 
reactance jw ol, of Fig. 4. Thus the Brune network for the series arm z, 
and the cross arm z, of structure VIII is shown in Figs. 5 and 6 re- 
spectively. 
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G = -L, eB 


La 


Ca 


Fig. 5——Series arm 2. for structure VIII of Fig. 2. (Z’s, Cs, and Rs are same 
as for Fig. 3.) 


V. EQUIVALENT NETWORKS TO STRUCTURES VII AND VIII 


To avoid the need for coupled coils in the lattices developed pre- 
viously, we can introduce the Bott-Duffin impedance arms in struc- 
tures IX and X to obtain equivalent networks to structures VII and 
VIII respectively in Fig. 2.4 The series and cross arm of structure IX 
have the same configuration as the cross and series arm respectively 
of structure X. Hence only the series arm of each lattice is shown 
in Fig. 7. These new lattice structures necessarily have the same phys- 
ical realizability requirements and exhibit the same typical charac- 
teristics as sketched in Fig. 2 for structures VII and VIII. The 
element values for the general case of these lattice networks without 
mutual inductance are given in Table I. 


VI. EXAMPLE 


We now illustrate by an example the methods we have developed 
to obtain realization of constant resistance lattice networks. Let us 
find such a realization given the transfer impedance function 


K(s* + 2.2688° + 6.517s? + 3.302s + 4.905) 
s* + 25° + 4.77858" + 5.556s -+ 5.556 


In order to represent the given function as tandem lattices, we ob- 


exp @ = (42) 


Ls 


Cy 


Fig. 6— Cross arm zy for structure VIII of Fig. 2. (L’s, Cs and Rz are same 
as for Fig. 4.) 


STRUCTURE 


GENERAL FORM REQUIREMENTS FOR TYPICAL ATTENUATION 
OF EXP 6 | PHYSICAL REALIZABILITY | PHASE CHARACTERISTICS 














Zy (SERIES ARM) 


La 


Zy HAVE THE SAME CONFIGURATION 
AS Zy IN (X) 


Z (SERIES ARM) 


Zy HAVE THE SAME CONFIGURATION 
AS Zyx IN (IX) 














Ja, a_| =]b, bo] 
b? +b? z=at+al 


THE a’s ARE COMPLEX 


THE b’s MAY BE REAL 
OR COMPLEX 


K (S-a;) (S-a) 
(s-b,) (s-b2) THE a’s ARE COMPLEX 


THE D'S ARE REAL 
OR COMPLEX 





Fig. 7— Constant resistance lattice equalizers. 
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TABLE I(a)—-ELEMENT VALUES FOR STRUCTURE IX 








Zi Z, 
R R, ] 
L, = C Liza ra 
l 1 
eee Rees 22 
Us eKu + B'Ro + DE a 
i tees saree 
[(a,a.K, + b,b2)°(K., + 1)]? 

Gy = Se = % 
= aK mn — Bo _ a,a.K.,, _ bbe ri 
Na aK, + Be > a,a,K,, + 6,6, C3 = Ds 

— 7/3 oe 3 
es (Kn = 1)°(a,a,K m = b,b.)]} 
M 
Cy = aa Li =. a 
K,—1 
Ry = Rot ChH=-T, 
C = Es a. Bo) (Bx = Kea) |! R! = i 
: (—B, = K,.0:)(Kn = 1) : Ry 
a {(eital — bibs) [(b: + bs) — Klay + aa] 
[—(b, + b.) — K,,(a, + a2)](K,, + 1) 
where 


M = (8, — Knou)[(a2Kn + B2)(Kn + 1)} 
— (6, + Kon) [Kn — B2)(Kn — 1)}* 
= [(b: + bo) — Kula, + a2)][(Gia2K, + b1b2)(Kn + D} 
— [(b, + b.) + K,(a, + a2)][(ia2Kn — bib2)(Kn — 1} 
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TABLE I(b) ELEMENT VALUES FOR STRUCTURE X& 


Z, 


ae { (Kn — 1)(8, — Kon) \ 
: (2K ny ae B2)(—B, i Kn) 


(aja,.K, -+ b,b.)[— (0, + b,) = K,,(a + a2) | 


Li 
RR, 





eS 


Ly = R3kiCy 








Bias M 
: (ak, — Bo) "(Km 1]? 
= M = 
[(a,a2K nm — b,b2)*"(Kn — 1)]? 
R! = Oa K in 7. Bo en 102K mn in bb. 





aK in + Be ~ a,a2K,,, + b, be 
Lk = RiRIC} 


= (K,, ae 1)*(e2K mn + B.)]? 
: M 


(K+ 1)*(a,a.K,, + b,b.)]? 








where 


M = (By — Knots) (2K m + BK + »} 
— (8: + Knoy)[(a2K mn — B2)(Kn — 1} 
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Ly 
DL, = Ct 
\ 
Lz = C3 
1 
Cy = TL, 
3 
1 
Rez a R. 
3 
I, = C4 
1 
Cy = L’ 
1 
kh, = R? 
4 
1 
C, = LU 
2 


= [(b, + be) — Krai + ae) ][(GidekK, + b1b2)(Kn + 1} 
— [(b + bo) + K,(a, + a)\[(a,a.K,, — bb. (Kn — 1)}' 
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tain a partial product expansion of exp 6 wherein the factors are 
bilinear or biquadratic forms. In the present example, we find 


exp 0 = K(& + 28 + 5) (# + 0.2685 + 0.981) 


s + 2s+2 s’ + 2.778 a 


or 





= s +28 +5 ( + 0.2688 + 0.981) | 
Ss [«( ae D5 8) || x. s+ 2.778 (44) 


where K or K,Ke2 are constant multipliers to allow for any corre- 
sponding net increase in loss required by the overall network. 
Each biquadratic factor must be physically realizable if it is to be 
synthesized using one of the basic structures. For the first factor we 
get the following zeros for the polynomials 


a, = —1+ 72, a = —1 — 72, (45) 
b, = -—1+j, b = —1 —j. 
From these a’s and b’s we determine 
a, = —2, a, = 5, 
B= —2, B= 2, (46) 


a+a=-6, +h =0. 


Substituting into the physical realizability conditions of structures 
VII and IX, we find that these conditions are satisfied. The second 
factor in (44) has the following values 


a, = —0.268, a, = 0.981, 
B, = 0, By = 2.778, (47) 


1 
ay 


1 1 1 


With these values the second factor satisfies the requirements for 
structures VIII and X. Hence, the given transfer function (42) can 
be represented by two second degree lattices in tandem with or without 
mutual coupled coils. 

Using (27), Ki; and Kz of (44) become 


K, = Kin = 1.2895, (48) 
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KS Ke 7035; | (49) 
and the constant multiplier K is 


K = K,K, = 9.0716. (50) 


Thus by substituting (46) and (48) into the explicit formulas for 
structures VII and IX, and substituting (47) and (49) into the explicit 
formulas for structures VIII and X, the element values for each 
corresponding structure can be obtained. These element values are 
summarized in Tables II, III, IV and V. It should be apparent that 
another realization may be obtained for (42) by interchanging the 
numerators of the two biquadratic factors given in (48). Then the 
counterparts would have to be re-examined to see which basic struc- 
ture would be realizable. 


VII. GENERAL SYNTHESIS PROCEDURE 


The flow chart shown in Fig. 8 is a guide for the general synthesis 
procedure of any physical realizable, rational transfer impedance 
function exp 6. This flow chart can be summarized as follows: 


(i) Factor the given transfer impedance function into first and 
second degree functions with both members of any conjugate complex 
pair of zeros and poles retained in each given partial product. 

(iz) Synthesize all first degree functions by structure III or IV 
according to their physical realizability conditions. 

(iit) Examine the poles and zeros of these second degree functions to 
see whether they are real or complex; then use the appropriate group of 
structures indicated. If the poles and zeros are real, factor the second 
degree function into first degree functions. 

(iv) Examine the physical realizability conditions further to de- 
termine to which sub-group of structures the function belongs. 


TABLE [I—ELEMENT VALUES OF STRUCTURES VII For (43) 


22 (series arm) Zy (cross arm) 
In = —0.0658 In = 2.0178 
Ie = 0.1290 Ls = 1.93879 
Ce = 1.0296 Cs = 0.0685 
IT; = 0.1344 Le = —0.9876 
R; = 0.5265 Ri = 1.8994 
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TABLE ITJ—ELEMENT VALUES oF StRucTURE IX ror (43) 
(EQUIVALENT TO STRUCTURE VII) 





2z (series arm) zy (cross arm) 
C, = 0.4956 I3 = 0.2084 Cy’ = 0.1842 C;' = 0.2084 
Ie = 0.1342 R, = 0.1264 Ly’ = 2.0178 Ri = 7.91138 
R; = 0.5265 In = 0.0424 R;’ = 1.8994 Ly’ = 3.1319 
Cs = 1.5695 C, = 0.3193 3 = 0.6371 C,' = 0.0424 


(v) Connect the synthesized elementary structures in tandem. 
(vt) Raise the impedance level to the desired Ry . 


The realizability conditions of structures VII and IX are the same 
and also those for structure VIII and X. If one wishes to avoid hav- 
ing coupled coils, he should use structures IX and X. Since struc- 
tures IX and X are generally more complex, one may elect to use 
the structures VII and VIII to save on the number of elements in 
the final network. 


VIII. CONCLUSION 


In the field of classical network theory, Bode developed explicit 
formulas in terms of the poles and zeros of the transfer impedance 
function for synthesizing constant resistance lattice structures of the 
types I, II, III, IV, V, and VI. This paper has shown the detailed 
development and derivations of explicit formulas in terms of the poles 
and zeros of the transfer impedance function for synthesizing types 
VII and VIII, with coupled coils by Brune Method, and for types 
IX and X which are new types of structures that may be used to 
avoid having coupled coils by Bott-Duffin Procedure. With the addi- 
tion of these results, a general method for synthesizing any physically 
realizable, rational transfer impedance function with explicit for- 
mulas is complete. The explicit formulas method developed in this 


TABLE [V—ELEMENT VALUES OF StRuUcTURE VIII For (48) 


Zz (series arm) 2y (cross arm) 
Ly’ = 0.7896 Lyf = —1,3958 
Ig = 2.4106 Ls; = 5.6767 
Co = 0.4572 Cz; = 0.1948 
L;’ = —0.5949 LL, = 1.8564 
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TaBLE V—ELEMENT VALUES OF STRUCTURE X FOR (43) 
(EQUIVALENT TO StrucTURE VIII) 





2z (series arm) zy (cross arm) 
C,' = 2.4677 C3’ = 1.2658 C, = 1.2665 Cz; = 1.1481 
LD,’ = 0.7896 Rj = 0.7511 In = 2.4677 Ly = 2.7222 
R;' = 0.4260 Cy = 2.7222 R; = 2.3470 C, = 2.4691 
L;’ = 0.8710 LL, = 0.4050 L; = 1.2658 Ry, = 1.33814 


paper gives more rapid results and introduces fewer round-off errors 
than the step-by-step procedures used in the past. 
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APPENDIX 


Derivation of Physical Realizability Conditions 


We now develop the physical realizability conditions for second degree 
lattices in terms of the poles and the zeros of exp @. We shall find that 
the requirement for non-negative loss at real frequencies for such two- 
ports leads to both a product and a summation condition on the poles 
and zeros of the transfer impedance function. This analysis is carried 
out in terms of an example utilizing structures VII and IX. These 
structures exhibit zero loss at a finite frequency w) (see Figs. 2 and 7). 
To evaluate the constant multiplier for these structures we set 
exp (2a) = 1 at w. , and let x = wo . Then from (16), k becomes 


ae (82 =2 fa) Bike 


(a, — Da) ale aiLo 
where a’s and #’s are given by (6) and (7) respectively. 


Substituting (51) back into (16) and applying the non-negative loss 
condition, that is, exp (2a) 2 1 for all frequencies we have 


le =n) + aby IL, — 2) + ox] = * (52) 


(51) 
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Fig. 8—Flow chart for synthesizing transfer impedance functions with constant resistance lattice networks. 
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We shall obtain one of the realizability conditions by letting the 
frequency approach infinity. The result is the expression 


(82 — Lo)” + Bix 2 (a, — Xo)” + aio (53) 
Expanding this expression and substituting for a’s and f’s from 
(6) and (7), we obtain 


(byba)” + (bi + b3)to Z (aide)? + (at + a5)ao . (54) 
For 
| Q,Q2 | Pi | bbe | (55) 
then 
(a,a2)” 2 (bib2)”. (56) 
Thus (56) can be rewritten as 
(a,d2)” = (bib2)” + ¢ (57) 


where ¢« is a positive quantity. Substituting (57) into (54) we get 


(by b.)” + (b; + b3)2o = (b,b.)” +e+ (aj + a3) Lo . (58) 
Simplifying and dividing both sides by 29 yields 


b+ Ba +a ts (59) 


Since « and 2 are positive quantities their ratio may be deleted 
without altering the inequality of (59). Thus we have shown that the 
realizability requirements for second degree structures VII and IX 
are given by the pair of expressions 


bi + bs = at + a3 (60) 


and 


IV 


| ad_ | = | bide |. 
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Recirculating Ultrasonic Stores: 
An Economical Approach to 


Sequential Storage with Bit 
Rates Beyond 100 MHz 


By E. K. SITTIG and F. M. SMITS 
(Manuscript received September 30, 1968) 


State of the art integrated circuits and ultrasonic delay lines can be 
combined to form batch-fabricated digital storage modules having random 
access to sequentially stored blocks of information. Greatest economy 1s 
indicated if such stores are designed for as high a bit rate as is technologi- 
cally feasible, at present limited by the speed of available integrated circuitry. 
A store of optimum design will have a block size of approximately 1000 
bits which, for a bit rate of 100 MHz, gives a maximum latency time of 
approximately 10 microseconds. Such designs are realizable with zero 
temperature coefficient material. The stores can be used as main memories 
for small computers or as fast transfer stores shuttling information be- 
tween a slow external bulk memory and a very fast random access memory 
in large computers. 

A vartety of accessing modes permit these stores to operate over a large 
range of access rates without requiring large buffer stores. 


I. INTRODUCTION 


In a computer of conventional organization, a central processor 
communicates with a large array of randomly accessible storage loca- 
tions, each of which contains one word of a given number of bits. 
The assembly of these locations, the ‘random access memory,” typi- 
cally consists of one discrete element for each bit stored, which oc- 
cupies a fixed location in space. This approach is comparatively 
costly. At present, the cycle time for such a memory of megabit size 
is of the order of one microsecond. 

Since cost and size normally prohibit providing storage for more 
than a few million bits in this form, additional bulk memory is pro- 
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vided in which bits are stored in homogeneous media at lower cost 
and higher density. Since the bit locations in this bulk memory are 
basically defined by sequential scanning from a given addressable 
starting location, the information has to be stored or read out sequen- 
tially as a block. Therefore, once the desired block is addressed, a 
certain latency time passes until the information is available. This 
typically ranges from 10 to 100 milliseconds in mechanically scanned 
systems such as drums or disks and is even longer if heads have to 
be repositioned. If the information is stored on magnetic tape, this 
latency time may be several minutes. 

The present trend is to have shorter processor cycle times and 
multiple access facilities in evolving computers; increasing emphasis 
is put on the ability to transfer blocks of information quickly be- 
tween a bulk store and the random-access memory with which the 
processor interacts. In order to avoid a bottleneck in the throughput 
of information, transfer stores of lower capacity but shorter latency 
time are provided; these transfer stores can be loaded from a slower 
store without intervention of the central processor but can also transfer 
data on demand with minimum waiting time. Drum stores and random 
access memory blocks are often used for this application. 

It is the purpose of this paper to point out that ultrasonic delay 
line stores have been developed to the point where bit rates of 100 
MHz and higher have become feasible; therefore, stores with opera- 
tional properties similar to those of a drum can be built which have 
maximum latency times of about 10 microseconds. Organized in 
parallel tracks, these stores can transfer many giga bit per second. 
In contrast to magnetic storage, these stores share with semiconductor 
stores the disadvantage of volatility with respect to power failure; 
but they have the advantage of high storage density and absence 
of moving parts. 

These devices appear to be a strong contender for buffer stores of 
relatively short latency time. Since they are sequential stores with 
the information stored in a homogeneous medium, one may expect that 
the storage cost could be considerably lower than would be the case 
for a random access memory. This paper will show this to be the case. 
As a matter of fact, the higher the frequency of operation the more 
economical a delay line store becomes since its components become 
more and more compact. At 100 MHz bit rate, for example, stores 
with packing densities exceeding 6000 bits per cubic centimeter are 
readily possible. 

In optimizing a store, the interrelation between delay line and 
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auxiliary circuitry needs to be considered. Details of delay line de- 
sign analysis have been discussed elsewhere.t The present paper uses 
the delay line design analysis in an optimization with the auxiliary 
circuitry to find a combination which is optimized from functional 
and economical considerations. 

The most repetitive elements in a delay line store are individual 
recirculating delay line loops. Accordingly, the optimization of indi- 
vidual loops will be considered first. This will be followed by giving 
detailed design considerations for the delay lines meeting such re- 
quirements. Finally, operational characteristics of a delay line store 
will be covered. 


II. GENERAL TRADE-OFF CONSIDERATIONS FOR A SINGLE DELAY LINE LOOP 


A basic recirculating delay line loop typically has the configuration 
shown in Fig. 1. Binary coded data in the form of pulses appearing 
at terminal DI are inserted into the delay line through gates B and 
A when a “write” command pulse appears at terminal W. As the 
pulses appear at the other end of the delay line, they are amplified 
to make up for the insertion loss of the line and are detected in the 
amplifier-detector AD. In gate E the detected signals are retimed 
with respect to an external clock frequency inserted at terminal Cl. 
They can be monitored at the data output terminal DO, or gated 


— 





AMPLIFIER 
DETECTOR 


| 
| 
| 
| 


Fig. 1—Basic recirculating storage loop. The terminals are as designated: 
DI, data insertion; W, write command; R, read command; DQ:, DOs, data 
output; Cl, external clock frequency. 
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through D to the output terminal DO, by a “read” command applied 
at R. Gate C provides reconnection to the input of the delay line 
unless disabled by the write command at W while reading in new 
data. Storage loops of similar configuration have in the past been 
used with bit rates of a few MHz and storage capacities of up to 
about 20,000 bits in electronic desk calculators and the like. 

A large store will have to contain a multiplicity of identical storage 
loops, and a limited number of control circuits, registers, and clock 
frequency supplies which are common to all loops. To minimize the 
over-all cost of the system, it is most important to minimize the cost 
and complexity of the components constituting the individual loops. 
In principle a given storage capacity may be obtained by using many 
relatively simple delay lines with a corresponding number of regen- 
eration circuits or by using fewer but more complex delay lines usually 
requiring more complex circuitry. As long as the circuits have to be 
built from discrete components, it is more economical to use long and 
fairly complex delay lines so that maximum use could be made of the 
expensive circuitry. 

As an example of this approach, a store has been built with a capacity 
of 13 x 10° bits, using 48 delay lines, storing 28,000 bits each at a 
bit rate of 40 MHz, which gives a resulting latency time of approxi- 
mately 707 microseconds.? The materials available for such a large 
storage capacity per delay line exhibit a sizeable absorption loss and 
a temperature coefficient of delay around 80 ppm per centigrade de- 
gree. With such a large temperature coefficient some form of tem- 
perature stabilization is necessary. The high insertion loss of the delay 
lines—typically 50 dB pulse-to-pulse—requires amplifiers with care- 
fully controlled linear gain at the output of each delay line to bring 
the signal back up to logic level. To retime unavoidable drift in tem- 
perature between the individual delay lines, the regeneration circuitry, 
in addition, has to provide the largest retiming margin possible. 

These stringent requirements might be relaxed considerably by an 
alternative approach using delay lines which store only about 1000 
bits each, so that the individual lines can be of a simple rectangular 
block configuration; the rectangular block configuration, in contrast 
to the polygons required in the above-mentioned example, can readily 
be batch fabricated in large numbers. Thus a cost saving appears pos- 
sible in spite of the 28-fold increase in the number of delay lines over 
the example mentioned before. Also, the shorter delay line length per- 
mits the use of a delay medium, with higher absorption but with a 
lower temperature coefficient, so that temperature stabilization equip- 
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ment becomes unnecessary. Finally, with shorter individual delay 
lines a 28-fold reduction in the latency time is achieved which, if 
combined with lower cost for the devices, could make it more attrac- 
tive for computer applications. 

This approach will only be economical if the concomitant increase 
in the number of regeneration circuits can be obtained at minimal 
cost. This should be feasible if each circuit could be built as an in- 
dividual integrated circuit of reasonable size. In order to make this 
practical, certain requirements are posed on the delay line perform- 
ance. Since with an integrated circuit level detector pulses of 30 
millivolts amplitude or more can easily be detected, and since an 
integrated circuit driver can deliver readily pulses of the order of 1 
volt amplitude, the delay line pulse-to-pulse insertion loss should not 
exceed 30 dB. If kept to such levels, closely gain-controlled linear 
amplifiers can be avoided. Also, the transducers of the delay line 
should have an impedance falling into the range of 10 to 100 ohms so 
as to permit coupling the delay line to integrated circuitry without 
the use of transformers or tuning inductors. 


III. DESIGN CONSIDERATIONS FOR THE ULTRASONIC DELAY LINES 


The simple delay line to be considered has the configuration shown 
in Fig. 2; two equal piezoelectric transducers of thickness 4, and ac- 
tive diameter 2r are affixed to a delay medium in the shape of a bar 
of length x and square cross-section having a width D. The delay 
medium is characterized by its sound velocity cg, density pg and am- 
plitude absorption index » (absorption per wavelength). The trans- 
ducer material is characterized by its sound velocity c,, density pz, 
permittivity «, and electromechanical coupling factor k. At the fre- 
quency f,, for which the thickness of the transducer equals half a 
sound wave length, that is, for 


fo = ¢-/2t, (1) 


the electrical impedance Z; appearing at its electrical terminals is 


1 ; 2 
Z;= Rus (—j + 4k'/xz,), (2) 


where C, = ar? «/4, and 2 = paCa/pcCe is the acoustic impedance ratio 
of the delay medium with respect to the transducers. Without tuning 
networks, maximum power is transferred between a source of im- 
pedance R, and the delay line, and likewise between the line and a 
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Fig. 2— The basic delay line configuration. Two transducers of thickness J, and 
diameter 2r are attached to a delay medium of length x and lateral dimension 
D. This delay line is connected between a source of resistance R, and load R: 
without tuning networks. 


load resistance R, , if 
R, = BR, 1/eC, ; w, = rf, . (3) 


provided that 4k’/xz «< 1 which is fulfilled in all the cases of interest 
here. 

As shown in detail in Ref. 1, with these electrical terminations one 
obtains a reasonably linear phase response and a pass band centered 
near f, which rolls off approximately like sin* (zf/f,) on either side 
of f, if z is selected to fulfill the condition 


z=1-—F. (4) 


With this pass band, a unipolar input pulse of rectangular envelope 
and a nominal width 


T = 1/2f, (5) 


gives rise to an output pulse having the. shape shown in Fig. 3. The 
center lobe of this pulse is about 6.5 dB below the amplitude of the 
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input pulse. This reduction in pulse amplitude is due to side lobes 
being generated by the band limiting characteristic of the delay line. 

Pulses can be inserted at a maximum bit rate equaling f, with 
adequate margin for binary detection. Thus one bit can be stored for 
each sound wave length A, in the delay medium so that the storage 
capacity N is given by 


N = 2/da = cut/f, . (6) 


The transducer insertion loss would be minimized if k is chosen as 
large as possible. This is evident from (2) in that more of the input 
voltage is dropped across the resistive part of the input impedance. 
However, k = 0.6 is about the maximum available in transducer ma- 
terials with reasonable technological properties so that a transducer 
insertion loss minimum of a few dB seems unavoidable. An attempt 
to trade off bandwidth for a reduction of loss would, as a rule, im- 
pair the detection margin of the output signal, and would probably 
increase the pulse amplitude insertion loss due to the pass-band char- 
acteristic above the value of 6.5 dB mentioned before. 

Within these limitations one may now choose a transducer-delay 
medium combination by criteria such as a small temperature coefficient 
and adequate sound absorption. As shown in Ref. 1 one can combine 
the expressions for the length of the delay line z, the thickness of the 
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Fig. 3— Output signal from a delay line when rectangular pulses of duration 
1/2f. are applied to its input. The worst case for detection is obtained when 
such pulses are entered at the rate f./2 representing a binary 1-0-1 sequence 
as shown; since the outermost sidelobes of the nearest neighbors will fill in the 
“zero” slot between “one” pulses, this creates intersymbol interference. A spurious 
signal at a —20 dB level further reduces the detection margin to the one shown. 
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transducer 4,, its capacitance C,, and the length of its Fresnel zone 
2X, which determines its directivity, namely 


t=Nrn 4.=¢/2,: €C,= 4er/i0 @2“=ar/r, WH 
to obtain a compatibility condition 


N = m-0,C,-x/2x, (8) 


with a materials constant 
m = c,/41'ec? . (9) 


This condition implies that the beam spreading loss LZ, (which is ap- 
proximately 2/x, in dB if x/x, < 10) and the impedance levels of the 
delay lines are interrelated for a given material combination and storage 
capacity. 

The total delay path loss Lz is composed in part of the absorption 
in the delay medium L, and the beam spreading loss Ly, . 


Le = DL, + Dy © x/x, + wN (10) 


with » in dB per bit. In order to keep the total loss below 30 dB, the 
above loss should be restricted to about 20 dB, since in addition there 
are a few dB transducer loss and the 6.5 dB loss in pulse amplitude 
due to the band-pass characteristic of the delay line. 

The absorption loss Z, increases with increasing frequency and 
thus introduces by itself additional distortions. Since, the beam 
spreading loss L, decreases with increasing frequency, it is possible 
at least to first order, to compensate these two losses by choosing 
them approximately equal at the center of the pass band leading to 
the condition 


t/t, = uN. (11) 


By restricting the total loss to 20 dB, one is limited to x/z, S 10. To 
reduce the spurious signals, (primarily the triple travel signal due to 
multiple reflection between the transducers) to at least 20 dB below 
the main response, a2 minimum propagation loss L, of 10 dB is necessary. 
This requirement limits the design range to 


5 S x/x, = 10. (12) 


The lateral dimension D of the delay medium is determined by the 
requirement that the directional response of the transducers sup- 
presses glancing reflections from the side walls by at least 20 dB. 
This is assured if? 
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Dp 5/02 (13) 


The above relations contain all information necessary for a complete 
design of an optimized delay line. 

It is interesting to note that (7), (9), and (11) do not contain the 
frequency explicitly. It enters only in implicit form through the ab- 
sorption index » which for most suitable materials increases less than 
linearly with frequency. 

The other factor to consider is the potential cost of the delay line 
when compared to other methods of information storage. The tech- 
nology required in fabricating delay lines is very similar to the semi- 
conductor device technology and, as in that case, the variable giving 
a measure of the cost is the area that has to be precision finished, 
plated, and so on. For delay lines this is the area of the two end faces, 
each of which is given by 


D? = Nvix/z, , (14) 


a relation obtained by combining (6), (7) and (18). 

Thus, the area per bit to be finished, 2D°/N, is seen to decrease as 
1/f? . Moreover, (6) and (14) combined state that the delay line volume 
zD° decreases like 1/f? . Thus at high frequencies the materials cost 
can be expected to be negligible compared to the finishing cost. These 
relations imply that, for economical reasons, the delay line should be 
operated at as high a frequency as possible, in spite of the reduction in 
storage capacity with increasing frequency which is imposed by the 
loss limit. 

With present-day technology, delay lines have been made with 
storage capacities of approximately 1000 bits and pulse-to-pulse in- 
sertion loss in the vicinity of 30 dB with bit rates beyond 100 MHz.* 
There is no reason that this frequency could not be further increased. 
However, at present integrated circuitry with toggle rates much above 
100 MHz has barely become available commercially so that at present 
100 MHz is the highest frequency that can be considered from a practical 
point of view. 

Once the frequency f, , the material constant m, and the impedance 
1/a,C, have been chosen, (9) indicates that the storage capacity N 
can only be varied in proportion to z/xz, . This in combination with (14) 
implies that the finished area per bit 2D’/N increases with the storage 
capacity N. The permissible range of N is limited by the considerations 
leading to (12). As mentioned before, in the optimum case the beam 
spreading loss should equal the bulk loss, a condition which usually can 
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Fig. 4— For a delay line consisting of sodium-potassium niobate transducers 
attached to a Bausch & Lomb T-40 glass delay medium and f. = 100 MHz: 
the loss in the delay medium Ja, the latency time tz, the combined cndface area 
of the delay line, silicon chip area both per 1000 bits, and their sum. Outside the 
shaded region either the loss is too high or the triple travel suppression too low. 


be met only approximately since the conditions imposed by (4) and (9) 
on the materials data will also have to be considered. 

A useful compromise on all counts consists of a delay line using as a 
delay medium, a glass with nearly zero temperature coefficient of delay, 
such as Bausch & Lomb’s T40 glass, and using ceramic sodium potas- 
sium niobate transducers. The materials constants for this system are 
as follows:’'* T40 glass: cz = 2.58 millimeter per microsecond, z = 0.51, 
»=9X 10° X (f/fo)°’* dB at f, = 100 MHz; sodium potassium niobate 
ceramic: c, = 3.68 millimeter per microsecond, «  500e, , k = 0.6. 
This combination fulfills (4) closely enough to be usable at bit rates 
up to f, with adequate detection margins. Figure 4 shows the delay 
medium loss L, of (10), the maximum latency time ¢, , and the processed 
area per 1000 bit, 2D” of (14) as a function of the storage capacity N for 
f. = 100 MHz. Condition (12) in this case limits the value of N between 
560 and 1100 bits as is also indicated in Fig. 4. 

For N = 1000 the delay medium length is « = 25.8 millimeters 
and the lateral dimension D = 2.45 millimeters, so that the material 
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cost can be considered insignificant compared with the processing cost 
of the end faces. These, however, tend to be proportional to the end 
face area 2D?, but will certainly be less than the cost of an equal 
area of integrated circuit chips, in view of the less complex procedures 
involved in fabricating the delay lines. The regeneration circuit com- 
prises about 30 transistors and should, at the present state of the art, 
require about 1 square millimeters of silicon.’ One circuit of this area 
is required for each delay line loop. The processed area per bit de- 
creases, therefore, inversely with the number of bits stored in a single 
delay line as indicated in Fig. 4. As a result of this the sum of the 
processed area of the chip and the delay line itself has a minimum 
near N = 3800 bits. If the processing costs per unit area were equal 
for the delay line and the chip, this minimum would correspond to the 
cost minimum. If, as appears likely, the cost per unit area is lower 
for the delay line, this minimum shifts to a higher N, close to the 
values of N between 560 and 1100 bit, permitted by the spurious sig- 
nal supression and insertion loss limit. With a transducer impedance 
of 28.5 ohms the bulk loss in the delay medium at 100 MHz can be 
made equal to the beam spreading loss. This impedance will pose no 
difficulty with standard integrated circuitry. 

It appears, therefore, that some presently available materials have 
close to optimum properties for the design of delay line storage loops 
operating at a bit rate of 100 MHz and storing 1024 bits with a re- 
sulting latency time of 10.24 microseconds. If such delay lines were 
produced by batch techniques, their cost should be comparable to 
those of a few square millimeters of silicon integrated circuits. The 
storage density in these devices is approximately 6000 bits per cubic 
centimeter of volume. Individual storage loops with delay lines stor- 
ing 1024 bits have been built and operated at bit rates of 100 MHz.° 


IV. ORGANIZATION OF A DELAY LINE STORE 


Delay lines of the design described above are highly compact and 
could be built in modules of, for example, 18 delay lines built in a 
single glass plate of approximate dimensions 2 inches by 1 inch by 
0.1 inch. The transducers would be mounted on the 2 inch by 0.1 inch 
faces while the 2 inch by 1 inch faces would be available as the sub- 
strate for interconnections, and the integrated circuit chips perform- 
ing the recirculation, clock, and control functions. If one uses two 
of the 18 tracks for parity check and timing functions, then each 
module would store 16,384 bits. Such a module might constitute a 
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repetitive element of any larger store so that the latency time for 
any randomly accessible block of information need not exceed 10.24 
microseconds at a bit rate of f, = 100 MHz, regardless of the store 
size. 

It thus seems sufficient to discuss the organization of an individual 
module. The simplest organization would consist of providing random 
access by a 4-bit track selector to each track which stores a block of 
data words in a word and bit sequential organization. The start of 
each track would be delineated either by a 10-bit cyclic counter, 
counting off the clock frequency, or by reading signals from one or 
two additional delay lines serving as timing tracks. By adding an 
address register and a circuit, comparing its content with the counter, 
individual words in each track can be addressed individually. This 
type of organization, shown schematically in Fig. 5 using as elements 
the storage loops of Fig. 1, is natural for bit-serial processing with 
its economy of equipment so that it may well find application as 
main memory in small computers, where a cycle time of 10 micro- 
seconds is quite adequate. 

Faster data transfer at the cost of more equipment is obtained in 
the word serial, bit parallel organization shown in Fig. 6. There the 
bits of any one word are contained in parallel tracks, so that a com- 
plete word is accessed by comparing its address with the counter 
reading. Data input and output have to be provided by parallel regis- 
ters; transfer of words to the outside world can occur at the bit rate 
fy. The store, whatever its size, can be written or read completely 
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Fig. 5 — Bit-serial, word-serial organization of B storage loops as in Fig. 1. 
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Fig. 6 — Bit-parallel, word-serial organization of B storage loops. 




















within 10.24 microseconds. In this form the store would cooperate 
best with a random access memory of about 10 nanoseconds cycle 
time without excessive buffer pile-up. As will be discussed in the 
Section V, slower, nonsequential operating modes can provide a match 
with memories of any cycle time between 10 nanoseconds and 10 
microseconds. 

Finally, like any sequential memory the delay line store can be 
organized associatively as shown in Fig. 7. All words are compared 
with a word preselected in the content register during the 10.24 
microseconds it takes for all words to pass by the test location. If 
the comparison extends only over a part of the bit forming a word, 
these bits can serve as a pointer address for sorting sequences, and 
so on. In this version the store acts as an associative memory with a 
cycle time of 10.24 microseconds. 


VY. TIMING PROBLEMS 


In the operation of a delay line store, the timing requires careful 
attention since in contrast to a digital shift register the delay line has 
a characteristic recirculation time. This time is dominated by the 
delay time ¢g of the individual delay lines while the circuitry will 
contribute only a minor additional delay. A delay line designed for a 
characteristic frequency f, can be operated at a bit frequency f, up 
to the frequency f,. If the frequency is chosen below f,, it is important 
that the pulse width is nevertheless maintained at 1/2f,. Deviation to 
longer or shorter values causes the insertion loss to increase, since less 
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Fig. 7— Bit-parallel, word-serial organization of B storage loops with asso- 
ciative addressing. A coincidence signal is produced when the content register 
agrees with a stored word, as the latter passes through the comparator. 


of the spectral energy of the input pulse falls into the pass band of 
the delay line. Storage capacity N and bit rate f, are interrelated by 


N= fo-ta ) (15) 


so that within certain limits storage capacity can be traded off in 
order to synchronize the store with an external clock frequency deter- 
mining the bit rate. However, usually it will be more advantageous 
to operate the store asynchronously with respect to the outside world. 
One then avoids the problems in distributing frequencies of 100 MHz 
over an extended system with a predetermined phase. Moreover, one 
can then slave the store clock to a delay line used as a timing track.? 

As in any asynchronous organization, buffer registers must be pro- 
vided which temporarily store information entered at one transfer 
rate and withdrawn at a lower one. Information accumulates in the 
buffer at a rate equal to the difference between the two rates until 
the complete block is transferred. This determines the buffer capacity 
required. It is therefore of advantage to know how a delay line store 
can be accessed at rates nearly equal to those of the equpment it 
serves, so that the buffer capacity can be kept low. 

If bits are loaded every tg second into a delay line loop of capacity 
N, bit (N + 1) will coincide in time with the first bit loaded if 


Nt, = pts (16) 


where p 2 1 may be any positive integer. However, if p has common 
factors with N, bits earlier than the (NV + 1)-st one will already co- 
incide in time, so that such values of p should be avoided. Hor oo = 1024, 
p may thus assume all odd numbers. 
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Combining (16) and (15) one obtains 
fe = 1/te = fi/p (17) 


as the permissible loading rates. These generally cause the bits to be 


stored internally in a scrambled sequence, with the exception of the 
choices 


p=qN+1 g=1,2,38,°--, (18) 


which cause storage in the original sequence, and 
p= qN—1 qg= 1, 2,3,---, (19) 


which cause storage in time-inverted sequence. Equation (18) is the 
basis of the well known delay line time compression (DELTIC) sig- 
nal processing systems described in the literature in which digital 
data are written into storage at a slow rate fg but are read at the fast 
bit rate f, = (qN + 1)fg.° Also, all pairs of loading rates fi and f go 
given by 


fea/fe2 = p/(QN + p) (20) 
cause storage in the same, although internally scrambled, sequence 
so that the bit addresses can be set by sequential counting rather than 
address comparison. 

Series-parallel conversion can be utilized to keep the difference of 
transfer rates between stores at a minimum. For instance, certain 
core memories provide access to, say, 72 bits every cycle with a cycle 
time of 1 microsecond. Parallel to serial conversion would make use 
of a bit rate of 72 MHz quite feasible, which would increase even 
further if various checking bits were added to each 72-bit word. 


VI. CONCLUSIONS 


Sequential storage in recirculating loops, consisting of ultrasonic 
delay lines in combination with integrated regeneration circuitry of 
medium scale complexity, has reached a state of the art where bit 
rates can be obtained which are, at present, unattainable with large 
scale integrated circuit registers. At bit rates around 100 MHz and 
beyond such storage appears economically competitive with LSI im- 
plementations operating at much lower speeds. Such bit rates have 
already been demonstrated. 

With modules storing blocks of 1000 bits at 100 MHz bit rate, 
larger stores can be built with latency times of about 10 microseconds, 
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for possible use as main memory in small computers or as fast trans- 
fer stores shuttling information between a slow external bulk memory 
and a fast random access memory in large computers. 

In spite of the inherently fixed recirculation time of such a store, 
it should be adaptable to various processor and bulk store speeds 
without having to provide more than a few words of buffer storage by 
a judicious combination of measures such as clock frequency varia- 
tion, DELTIC modes, and series-parallel conversion. This opens the 
prospect of using only one or a few basic types of storage modules 
with a standardized delay time so that maximum advantage could 
be taken of the savings inherent in mass fabrication. 
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Queues Served in Cyclic Order 


By R. B. COOPER and G. MURRAY* 
(Manuscript received September 30, 1968) 


We study two models of a system of queues served in cyclic order by 
a single server. In each model, the ith queue is characterized by general 
service time distribution function H;(-) and Poisson input with param- 
eler \; . 

In the exhaustive service model, the server continues to serve a particular 
queue until the server becomes idle and there are no units waiting in that 
queue; at this time the server advances to and immediately starts service 
on the next nonempty queue in the cyclic order. 

The gating model differs from the exhaustive service model in that when 
the server advances to a nonempty queue, a gate closes behind the waiting 
units. Only those units waiting in front of the gate are served during this 
cycle, with the service of subsequent arrivals deferred to the next cycle. 

We find expressions for the mean number of units in a queue at the 
instant it starts service, the mean cycle time, and the Laplace-Stieltjes 
transform of the cycle time distribution function. 


I. INTRODUCTION 


We consider a system of queues served in cyclic order by a single 
server. The ith queue is characterized by general service time dis- 
tribution function H;(-) and Poisson input with parameter );. 

We study two variations of this model. In the first, called the 
exhaustive service model, the process begins with the arrival of a 
unit at some queue, say A, when the system is otherwise empty. The 
server begins on this unit immediately, and continues to serve queue 
A until for the first time the server becomes idle and there are no 
units waiting in queue A. The server then looks at the next queue 
in the cyclic order, queue A + 1, and serves those units, if any, that 
have accumulated during the serving period of queue A. The server 
continues to serve queue A + 1 until for the first time the server 
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becomes idle and there are no units waiting in queue A + 1. The 
process continues in this manner, with the queues being served in 
cyclic order, until for the first time the system becomes completely 
empty. The process is then re-initiated by the arrival of the next 
unit. No time is required to switch from one queue to the next. 

The second variation, called the gating model, differs from the first 
in the following way: When the server moves to a queue with at 
least one waiting unit, the server accepts only those units that were 
waiting when the server arrived, deferring service of all subsequent 
arriving units until the next cycle. That is, in the gating model, at the 
instant the server advances to a nonempty queue a gate closes behind 
the waiting units, and only those units waiting in front of the gate are 
served during that cycle. 

The exhaustive service model is analyzed in detail. We obtain the 
generating function for the joint probability distribution of the num- 
ber of units in each queue at an instant at which the server finishes 
serving any one of the queues. We then obtain expressions for the 
mean number of units in a queue at the instant it starts service, the 
mean cycle time, and, in a form suitable for numerical computation, 
the Laplace-Stieltjes transform of the cycle time distribution function. 

Finally, we note that the equations describing the gating model 
differ only trivially from those of the exhaustive service model, and 
that the same method of solution applies to each. 

Systems in which a single server is shared among several queues 
are common. For example, in the No. 1 Electronic Switching System 
the central control spends much of its time polling various hoppers 
and performing work requests that it finds in these hoppers. Similarly, 
in a time-shared computer system the users have access through tele- 
typewriters to a central computer which is shared among them. The 
cyclic queueing models studied here are of a type which may be useful 
in the analyses of these and similar problems. 

The exhaustive service model for the special case of two queues 
has been studied by L. Takacs,t B. Avi-Itzhak, W. L. Maxwell, and 
L. W. Miller,?? and M. F. Neuts and M. Yadin.* Avi-Itzhak eé al 
used an argument based on the properties of mean values, to obtain 
an expression for the mean waiting time suffered by a unit in either 
queue. Takacs, by a more direct argument, utilizing the Markov 
chain imbedded at the epochs of service completion, obtained the 
corresponding Laplace-Stieltjes transform and formulas for the wait- 
ing time moments assuming service in order of arrival. Neuts and 
Yadin obtained waiting time results for the transient case. 
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The more general two-queue model in which the time required to 
switch from one queue to the next has some arbitrary distribution 
function is also studied in Ref. 3, and has been investigated in addi- 
tion by M. Eisenberg® and J. 8. Sykes.6 M. A. Leibowitz™:* has stud- 
ied a multiqueue model similar to the gating model studied here. A 
nonprobabilistic approach to cyclic queueing problems has been used 
by J. B. Kruskal.® 

In the present paper we use the imbedded Markov chain approach 
but, as with Neuts and Yadin, our chain is imbedded at the instants 
at which the server completes serving a queue, rather than at the set 
of all instants of service completion used by Takacs. Whereas Takacs 
and Neuts and Yadin obtained waiting time results, our analysis yields 
cycle time results. The mathematical analyses characterizing the three 
approaches share some common ground, although the differences, espe- 
cially those arising from our consideration of an arbitrary number of 
queues, are significant. 

Also, in a recent nontechnical article on queues by Leibowitz,° the 
present problem is offered as a prime example of an important, dif- 
ficult, unsolved queueing problem. 


Ii, PRELIMINARIES 


In the analysis of the exhaustive service model, we take the number 
of queues to be N + 1 = 2. Units arrive at the zth queue according to 
the Poisson process with rate \, ; that is, the probability Q,(k; ¢) that 
k units arrive at the 7th queue in an interval of length ¢ is 


(A;t)" ; 
Q.(k; t) = k! exp (—),) (kK =0,1,2,:°:+ ;2=0,1,---,N). 





The length of time required to serve a unit from queue 7 has distribution 
function H;(-) with mean h; (¢ = 0,1, --- , N). 

In the analysis of the exhaustive service model, we shall use the 
concept of busy period, discussed at length by Takdcs"’. For the ordinary 
single-server queue, the busy period is defined as the length of time 
from the instant a unit enters a previously empty system until the 
next instant at which the system is completely empty. Both the distri- 
bution function of the busy period and its Laplace-Stieltjes transform 
are known explicitly for the 1//G/1 queue. In particular, the M/G/1 
queue with arrival rate \ and mean service time A has a busy period 
with mean b = h/(1 — dh) if Ah < Landb = o if Ahk 2 1. 

Consider now the M/G/1 queue with j waiting units; define the 
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j-busy period as the length of time from the instant at which service 
starts on the first of the 7 units until the next instant at which the 
system is completely empty. (When 7 = 1, the j-busy period and the 
busy period are identical.) Each of the j units, which together generate 
a j-busy period, individually generates a 1-busy period. Thus (as 
Takacs shows) the distribution function of the j-busy period is the 
j-fold convolution with itself of the distribution function of the 1-busy 
period. 

Denote by B;(-) the distribution function of a 1-busy period for 
queue 2, by 6;(-) its Laplace-Stieltjes transform, and by b; = h,/(1 — 
\,h;) its mean. Let B*¥‘(-) be the 7-fold convolution of B;(-) with itself, 
B*i(-) = B;(-). Then a j-busy period for the 7th queue has distribution 
function B*‘(-) and Laplace-Stieltjes transform (6;(-))’. 


HI. FORMULATION OF IMBEDDED MARKOV CHAIN STATE EQUATIONS FOR THE 
EXHAUSTIVE SERVICE MODEL 


There are N -+ 1 = 2 queues. Suppose that the system is idle and a 
unit arrives at some queue at epoch 7, . The server immediately com- 
mences service at that queue, and continues to serve units at that 
queue until the first instant 7, at which that queue becomes empty. 
If the system is not empty at 7, , the server advances to the next queue 
in the cyclic order. The server immediately commences work at this 
queue until the instant 7, at which this queue becomes empty (where 
tT, = 7, if the server finds the queue empty), and continues on in this 
manner until for the first time, 7, say, the server finishes serving a 
queue and there are no units waiting anywhere in the system. The 
process terminates at 7, and is reinitiated by the next arrival. 

Thus the process generates a set of points 7) , 71, °** » Tn » Where To 
is the arrival instant of a unit at some queue in the previously empty 
system, and 7, is the first instant at which the system becomes com- 
pletely empty again. The next arrival, at epoch 7. say, reinitiates the 
process, and a new set of points, 79 , 7 , *** , Ta’ Is generated. We call 
the points 7, , --+ , 7, (and 7, -*+ , T,) switch points. 

Note that 7. is not a switch point, whereas 7, is a switch point. 
Successive switch points may occur simultaneously in time, but are 
nevertheless considered distinct. Thus, with each switch point is as- 
sociated a queue, namely, that queue at which the server has just 
completed its visit. 

When the server finishes serving a queue and finds the system com- 
pletely empty, a switch point associated with that queue is recorded. 
The next switch point is recorded when the server leaves the queue at 
which the process is reinitiated, and is associated with that queue. 
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Let (z; n,, -++, my) denote the state of the system at an arbitrary 
switch point, where 2 is the index of the associated queue, and 7, is the 
number of units waiting in queue 7 + k(k = 1, --- , N). [For simplicity, 
no special notation will be used to denote arithmetic mod (N + 1).] 
Let the state (2; n, , +++ , my) have probability P;(n, , --- , nw); that is, 
P;(m, , +++ , ny) is the joint probability that at a switch point, the 
server has just completed a visit to queue 7 (¢ = 0, 1,---, NM) and n 
units are waiting in queue 7 + 1, n, units in queue 7 + 2, --- , and ny 
units in queue 7 + N. 

The state (7; mn, , --+ , my) can occur through the following exhaustive 
and mutually exclusive contingencies: 


(«) The server leaves queue 7 — 1 and finds j7 = 1 units waiting 
for service in queue 7, where it thus spends a length of time 
equal to a j7-busy period. 

(it) The server leaves queue 7 — 1 and finds 7 = 0 units waiting 
for service in queue 7, but at least one unit waiting for service 
somewhere else in the system, so that the server then 
“passes through” queue 7 in zero time. [That is, the state 
(7; ny , *** Ny-1 , 0) necessarily follows the state (¢ — 1; 0, 
M, »***, Ny-1) where at least one of the n,4~0(k=1,---, N—1).] 

(iit) The server leaves some queue and finds no units waiting any- 
where in the system. With probability \;/\ (A = Xo +--+ + Aw) 
the next arrival (which reinitiates the process) occurs at queue 2, 
where the server then spends a 1-busy period. 


These considerations lead directly to the imbedded (at the switch 
points) Markov chain probability state equations: 


Pm y oe , Nw) 
= DD ee Do Pia bi gA®* fhe) 
j=1 ki=0 kn-1=0 
o N-1 : : 
[TT Qisn(m = ben 5 DQrexn 5 1) dB CD 

N-1 

am P;-,(0, Mm,c** , Ny-1) (1 = i( >) d(ny) 
m=1 


+% FPO, +0) fT Qisnl tn 5) ABAD 


fon ft are f= Oto a] (1) 
0 if «+0 
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[Throughout the analysis, arithmetic mod (N + 1) in subscripts 
will not be specially denoted.] Assuming it exists, the distribution 
{Pi(nm1; +++ , my)} is uniquely determined by (1) and the normaliza- 
tion equation 


ese Pitt ee (2) 


t=0 ni=0 nn=0 
(Intuitively, one would expect a unique stationary distribution to 
exist when >>", Ach; < 1.) 
IV. FUNCTIONAL EQUATIONS FOR GENERATING FUNCTIONS 


We define the probability generating functions g;(11, -** , tw): 


g(t, +++ ty) = ye seal > Pim, +°* , Mya ++ tn 
ni=0 nmn=0 
(i =0,1,-:- ,™). (8) 
Substitution of (1) into (3) yields, after some rearrangement, 


gi(Xy yee , ty) 


i] 


=i Do Dy ee: ss Pyi(j, hy 00+, Byer)art +++ ya" 


7=1 ki=0 kn~1=0 


ie sare (1 2 i( 5 n,)) 


ns=0 nn=0 m=1 


-8(ny)Pi-1(0, my +7 * » Mw-s)Br es BY 
N oe N 
+ * > P,(0, --- , 0) it exp (—i > Nan(l — v.)} dB ;(t) 
k=0 0 m=1 


@@ =0,1,--- ,N). (4) 


The integrals on the right side of (4) are recognized as the Laplace- 
Stieltjes transform (@;(-))’ of the j-busy period distribution function 
with argument >." _. Asam(1 — 2%»). Hence (4) yields the set of simul- 
taneous functional equations 


N 
gilX1 pore , ty) = na(a( Neam(l = r.)) »%1,°"* aes) 
m=1 
N 


a Ma( > Niew(l — v.)) Dd P.(0, ~*~ , 0) 
— PQ, +++ ,0)- (i =0,1,--- ,). (5) 
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V. SOLUTION OF THE FUNCTIONAL EQUATIONS 


For notational convenience, we define the nesting operator % for any 
sequence of functions {f,(-)} for which it is meaningful: 


E fae) = fal---Calhfol@)))- ++). 


We shall denote by x the vector with components 2, , --: , ty , and 

by 0 the vector with all components zero. Both vectors and vector- 

valued functions will be denoted by boldface type, and square brackets 

will be used to enclose vector arguments of vector-valued functions. 

Finally, we will denote by ¢(v) the first component of a vector v. 
Define the vector functions 


Z.[t. ,-°°* , ty] = Ee Arem(l — r)) eee oe 


so that (5) can be rewritten - 


gi(X) = gi-1(Z,[x]) + * 4(Z.lx) X P,(0) — P,-,(0) 
(=0,1,--- ,M). (7) 


-sts)) 


— Pn) G=0,1,--, No =12-5). 8) 


mel 


Iterating v — 1 times on7in (7) we obtain 


yol 1 N yl 
g(x) = o( = Z,-.(81) Ae d dX P.(0) > | 


i hy 3 


In particular, when vy = N + 1 (8) can be written 


oi( = Ze-sfxl) — gis) = PO(L — 2 Yo r-n0 E Zi- ~i) 


k= 


o 


= 0, 


— 


»N) (9) 
where we have set 
PO) = X P,(0). (10) 


We shall now solve (9) by extending a method devised by M. F. 
Neuts”? for the solution of a related equation in one variable. 
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Define the iteration procedure 
> N 2 
ViP [x] = 2 Z,-,[Vi~” [x]] 
kad 


@=0,1,-°+,N3j= 1,2, °°: > Vi [x] = x). (11) 
Using (11) in (9) gives 
g:(V3” [x]) = g.(Vi7~” [x]) 
N m 
= PO(1 - > alk ic} Z.-s1V!-fxll)) 
a m=0 k=0 
C= 0, E24 N57 = 1, 2) 229): (12) 
Adding equations (12) for7 = 1,2,...,n yields 
n—-1 1 N m ; 
gi(V§” [x]) — g(x) = P(O) > (1 nt > end Z,-u1V-" fel)) 
@=0,1,°*:,Njn = 1, 2, ++). (13) 
Now let n — o in (13). We will show in the next section that 
lim V;" [x] = 1 (vw S1,°---,¢y $1;7=0,1,---,N) (14) 


NO 


where 1 = [1, 1, --- , 1], so that (18) becomes 


a(t) — ai) = PO) (1-2 Yo rv-n0 E Zul! teH)] 


(¢ =0,1,--- ,N). (15) 
Notice that 
2 9d) = 1 (16) 
and 
Dy g:(0) = P(0) (17) 


so that upon setting x = 0 in (15) and adding forz = 0,1, --- ,N 
we obtain 


Pe =(1+ 4.) | (18) 
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where 


Ai) = 3 (1 - ED vend E Zul!” te1))) 


7=0 m=0 
@=0,1,--+,N). (19) 


(We remark that P(0) ¥ 1 — >°%_, \,h; because the set of switch points 
is not an arbitrary subset of the set of all points at which units leave 
the server.) 

It remains to calculate g;(1). Physically, g;(1) is the probability that 
at the instant the server leaves some queue, that queue is queue 7. This 
event occurs if 


(z) the last time the server left a queue the system was empty, and 
the next arrival occurred at queue 2, or 
(iz) the last time the server left a queue the system was not empty, 
and the queue was queue 7 — 1. 
Event (2) has probability (\,;/\) P(0); event (7) has probability g;-,(1) — 
Ji-1(0). Hence 


9.2) =™ PO + Gill) -— 90) G=O1,+,N). 00) 


[Equation (20) can also be obtained directly from (7) with x = 1.] 
But the difference (g;_-,(1) — g:-1(0)) can be evaluated from (15) with 
x = 0. Hence 


9:1) =¥PO+POAAO G=01,°,N) BY 


so that (15) can be rewritten 


* + A,-1(0) — A,(x) 
=>? OO @@ =0,1,--: ,N). (22) 


1+ Y As) 


The quantities on the right side of (22) are completely specified; the 
set of simultaneous functional equations (5) has been solved in the 
sense that g;(x) may be calculated for any x S$ 1. 


VI. PROOF OF CONVERGENCE 
We wish to prove statement (14): 


lim V$” [x] = 1 (4, S1,°-- ,2y S1;7=0,1,--: ,N). 


n-7O 
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Note first that V‘"[x] is a vector whose (V + 1 — m)th element 
(m = 1,2,---,N) is 


of = Z,.[Vi” xl) : 
k=0 
Therefore, we need show only that 


lim a Z,-41V"” fl) a 


n-00 k=0 


@=0,1,°->,Njm=1,2,---,Nja S1,°-++,2rS 1. (23) 


From the definition (11) it is clear that the sequence {V$” [x]} is bounded 
asn— o forx S 1, and therefore the sequence {g,;(V‘$” [x])} is bounded 
asn— o forx < 1. Also, 


Os a( Z.-alVS"(xl)) Sl @>1,x8)). (24) 


We now turn our attention to equation (13). From (24) we see that the 
right side of (13) increases monotonically with n for x < 1, and there- 
fore the sequence {g;(V{”[x])} increases monotonically with n forx < 1. 
Thus the sequence {g,;(V{” [x])} is monotonically increasing and bounded 
for x S 1, and therefore has a limit. Hence the left side of (18) has a 
limit, which implies that the series of nonnegative terms on the right 
side of (13) converges. This in turn implies that 


lim? Y rad{ E ZalVM El) = 1 SD. 


N70 m=O 


Statements (24) and (25) together imply (23), completing the proof. 


VII. MEAN NUMBERS OF WAITING UNITS 


Denote by 7;(k) the mean number of units waiting in queue7 + k when 
the server leaves queuez (¢ = 0,1,---,N;k=0,1,---,N37,(0) = 0). 
For convenience, let g;(1)7;(k) = m;(k) and m;(1) = m;.Then )™_, m; 
is the mean number of waiting units found by the server in the next 
queue in cyclic order at a switch point, and >)", m:(k — 2) is the 
mean number of waiting units in queue k at a switch point. We shall 
evaluate 7;(k) (¢@ = 0,1,---,N;k = 1,2,---,N). 

We first note that 7,(k) is given by 


os 6] 
m,;(k) = ax, gi(a1 ; aE | ty) 


ryseemryel 


(§=0,1,---,N;b=1,2,---,N) (26) 
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and the mean 1-busy period b; = h,/(1 — Ahi) generated by a unit 
In queue 2 is given by 


@@=0,1,--- ,N). (27) 





d 
eS "ds B.(s) va 
Differentiating through (5) we obtain 


re) 

Saullo 
Sree 2 (> Ayen(l — )) 3 ” GJi- (8; ’ ane | tia) 
eed 


+& PO = B( dvvall — a)] 
x Ox, Bh Sones t+m m. 
@=0,1,-:-,N;k=1,2,---,N) (28) 


which upon setting x, = ... = wy = 1 gives the two-dimensional set 
of linear equations 


CAC Ree et eee * PO)Assub: + (1 — (N—1)m,.(k + 1) 


@=0,1,-+- ,Njk=1,2,---,N). (29) 
For each 7, (29) can be solved successively starting with k = N 
and working backward: 


q41 7+1 


m(N i ) = NieN-i PD Biai-yMi-y i NP(O)Ayv-; oS Ni+1-,Oia1—» 
y=1 y=] 


GOS 0)1,. «Ni S01, ey N =): (30) 
In particular, when 7 = N — 1 equation (30) can be written 
t+N i+N 
hi = Nisr D> dyarty NPO)Arer Do Avsidyas 
veitl y=tt+l 
@=0,1,---,N). (2) 


When. \;410;+177,; is added to both sides of the 7th equation of the set 
(81) we have after rearrangement 


t+N t+N 
mM; (1 ss Nea Ds a Az. er — \ 'P(0) > Neei0ee1 = 3 by 41M, 
veitl vat 


@=0,1,-::,). (32) 
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The sum on the right side of (32) is a constant independent of the 
value of the index 7. Hence 
t4+N 


mM; (1 + Aree ae \'P(0) Ss Ava 0s 41 


v=ittl 


i+i+N 
= = Mi4;(1 a bere rey ye een) heer rs \~'P(0) De Nv 410s 43 
pettjgt+l 
@=0,1,-°-,N;7=0,1,--: ,N). (33) 
Combining (33) and (31) yields 
re 1 P(g) 2 Pest (i =0,1,-:: ,N) (34) 


where we define p; = \,h; and p = >,%., p; . Note that for (34) to be 
meaningful we must have p < 1. The {77;(k)} can now be calculated 
from equations (34) and (80). 


VIII. LAPLACE-STIELTJES TRANSFORM OF CYCLE TIME DISTRIBUTION 
FUNCTION 


Consider the set of switch points associated with queue 7, and append 
to this set every switch point associated with queue z — 1 at which the 
server finds the system completely empty. Call the elements of this 
augmented set the record points associated with queue 7. 

We define the partial cycle time for queue z as the elapsed time be- 
tween a switch point associated with queue z — 1 and the temporally 
preceding record point associated with queue 7. Denote by G;(-) the 
distribution function of the partial cycle time for the 7th queue, and by 
4;(-) its Laplace-Stieltjes transform. 

Since queue 7 is necessarily empty at an associated record point, all of 
the units waiting for service in queue 7 at a switch point of queue z — 1 
must have arrived during the preceding partial cycle time. Let P;_,(7) 
be the conditional probability that 7 2 O units will be waiting for 
service in queue 2, given that a switch point associated with queue 7 — 1 
has just occurred. Then the distribution function G;(-) of the partial 
cycle time for the zth queue and the distribution {P;_,(j)} of the 
number of units that arrive (according to the Poisson process with rate 
\,) during the partial cycle time are related as follows: 





PGi [ : A exp (—,f) dG,() 


@=0,1,---,N;7=0,1,---). (35) 
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Notice also that the distribution {P;_1(j)} has probability generat- 
ing function 
= gi-1(, 1, eewES, 1) 


PO) aur ea GS-0, 414 , N). (36) 


Substitution of (385) into (36) yields, for the Laplace-Stieltjes transform 
4;(-) of the partial cycle time distribution function for queue 2, 


rN; =.§ 
a-( Xs ’ 1, ’ 1) 


i(s) = gy @=0,1,°:-,N). (87) 


We define the (full) cycle time for queue 7 as the partial cycle time 
plus the time required to serve those units, if any, waiting in queue 
t when the server finishes queue 7 — 1. (Notice that in order to be 
counted as a cycle for queue 7, a time interval must contain a partial 
cycle ending at a switch point at queue 7 — 1.) Denote by G;(-) the 
distribution function of the cycle time for the 7th queue, and by y;(-) 
its Laplace-Stieltjes transform. 

The cycle time distribution function G;(-) is related to the partial 
cycle time distribution function G;(-) as follows: 


t © rd; 7 ‘ ~ . 
G0 = [SD exp (0.9 BHU - 9 dG. 
BEC) = 170 = 0,122 5 N)e. . 38) 
Taking Laplace-Stieltjes transforms throughout (88) we obtain 
yi(s) = 4:0; +s — 8; (s)) Cat a Pee yA) (39) 


Hence we have for the Laplace-Stieltjes transform y;(s) of the cycle 
time distribution function for the 7th queue 


g(t) — 8 ’ 1, Pim ty xy 1) 
eS Gal) 


By differentiating through (40) we obtain for the mean cycle time 
t; the intuitively obvious result 





CSOT N20) 


i = (b; + Ny iia (a = 0, 1, ws , N). (41) 
IX. THE GATING MODEL 


Consider now a system of N = 1 eyclic queues described by the 
gating model of Section I. Define P;(n, , --- , ny) as the joint prob- 
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ability that at the instant the server leaves a queue, that queue is 
queue z (¢ = 0,1, --- , N — 1) and n, units are waiting in queuez + 1, 
MN, units in queue 7 + 2, --+ , and ny units are waiting in queue 7 (that 
is, Ny units arrived at queue 7 after the closing of the gate). Denote by 
H*(-) the j-fold convolution with itself of the service time distribution 
function H,(-). Then 


Pim ie ee Ny) 
-) ni By wo N-1 

= > aD a a P3-1(j, ki cs » Ky-1) TI] Qisn(tm — kn 38) 
7=1 ki=0 kn-1=0 0 m=1 


Qsanlrw 5) dHF(t) + Pi10,m , + sriy-)(1 = (> nn)) 


m=1 


-d(ny) + * > PAO, se> , 0) [ I] Qi+m(%m 5 t) dH;(t) 


¢@=0,1,---,N —1) (42) 
where Q; = Qi+y. 
Equation (42), for the N-queue gating model, is only trivially dif- 
ferent from (1), which describes the (N + 1) — queue exhaustive ser- 
vice model. The analogue of (5) is 


N 
gil(X1 oe ec , ty) = nan peer Fee, v.)) a ays) 


oe N N-1 
x nl Niel — 2) 2d P.(0, eA) 


— P;-,(0, +++ ,0) (@=0,1,-°--,N—1) (48) 
where »;(-) is the Laplace-Stieltjes transform of the distribution func- 
tion H;(-), and g;(a1,..., 2y) is now the generating function for the 
gating model state probabilities. The solution of (43) follows that 
given for (5), and a complete analysis may now be carried out in a 
manner similar to that employed for the exhaustive service model. 
(We remark in passing that the equations originally considered by 
Neuts are those of the gating model with N = 1.) 


xX. SUMMARY 


Two models of a system of queues served in cyclic order by a single 
server have been presented. One of these, the exhaustive service model, 
has been analyzed in detail. This model is described by the imbedded 
Markov chain probability state equations (1), from which a set 
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of functional equations (5) for the probability generating functions 
are derived. The functional equations are solved with the help of a 
generalization of an iteration procedure used by Neuts. The equa- 
tions (5) are then used to obtain explicit expressions for various 
mean values, such as the mean number of units found waiting by the 
server in the ith queue, given by equation (34), and the mean cycle 
time, given by equation (41). The Laplace-Stieltjes transform of the 
cycle time distribution function is given, in a form suitable for nu- 
merical computation (and hence numerical inversion), by equation 
(40). 

It is then shown that the gating model is described by state equations 
only trivially different from those of the exhaustive service model. It 
is now easy to adapt the methods and results of the detailed analysis of 
the exhaustive service model to a similar analysis of the gating model. 


It is noteworthy that all results are expressed directly in terms of the 
single state probability P(0) and the relevant generating functions, so 
that there is no need to evaluate the individual state probabilities. The 
calculations are thus reduced to the iteration algorithm, which may be 
suited to digital computer solution. 
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A Hybrid Coding Scheme for Discrete 


Memoryless Channels* 


By D. D. FALCONER 
(Manuscript received June 6, 1968) 


We consider a coding-decoding scheme which can permit reliable data 
communication at rates up to the capacity of a discrete memoryless channel, 
and which offers a reasonable trade off between performance and complexity. 
The new scheme embodies algebraic and sequential coding-decoding stages. 
Data is initially coded by an algebraic (Reed-Solomon) encoder into blocks 
of N symbols, each symbol represented by n binary digits. The N n-bit 
symbols in a block are transmitied separately and independently through 
N parallel subsystems, each consisting of a sequential coder, an inde- 
pendent discrete memoryless channel, and a sequential decoder in tandem. 
Those coded n-bit symbols which would require the most sequential de- 
coding computations are treated as erasures and decoded by a Reed- 
Solomon decoder. We show that the hybrid technique reduces the variability 
of the amount of sequential decoding computation. We also derive asymp- 
totic results for the probabilities of error and buffer overflow as functions 
of the system complexity. 


I. INTRODUCTION 


It is well known that the use of block coding and maximum-likeli- 
hood decoding permits transmission of information at rates up to the 
capacity of a discrete memoryless channel with an error probability 
which decreases exponentially with the code block length.1-* A discrete 
memoryless channel (DMC) may be an adequate model for some 
types of real one-way digital communication channels consisting of a 
transmission medium, transmitting and receiving equipment and modu- 
lation-demodulation scheme. An arbitrary DMC is assumed to have 


* This research is partly, based on a Ph.D. thesis, Department of Electrical 
Engineering, Massachusetts Institute of Technology, 1966, carried out at the 
Research Laboratory of Electronics, Massachusetts Institute of Technology, 
supported by the National Aeronautics and Space Administration (Grant NsG- 
334) and a Hughes Industrial Fellowship. 
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a P-symbol input alphabet and a Q-symbol output alphabet. During 
each channel use, an input symbol is selected and transmitted and 
an output symbol is received. Successive input-to-output transitions 
are random and statistically independent; the probability that the 
output is symbol j (j = 1, 2,..., Q), given that the input is symbol 
1 (t= 1,2,...,P), is qj. (Table I contains a list of the symbols 
used throughout this paper) 

Maximum-likelihood decoding, which is known to be optimum, 
involves the cross-correlation of a received block code word with all 
possible transmitted code words. The number of code words, and 
hence the required number of decoding operations, grows exponentially 
with the block length; this exponential growth in decoding complexity 
makes maximum-likelihood decoding impractical, even for moderate 
block lengths. There has thus been considerable incentive to find 
suitable classes of codes having nonoptimum decoding schemes, for 
which the complexity (reflecting the number of components and the 
number of decoding operations per unit of transmitted information) 
does not increase exponentially with the block length. 

A number of coding-decoding schemes have previously been pro- 
posed. Among the most widely known are: 


(1) Algebraic coding and decoding schemes.® ° 

(it) Elias’ iterated coding and decoding.’ 

(iit) Massey’s threshold decoding of convolutional codes.’ 
(iv) Gallager’s low density parity check codes.® 

(v) Sequential coding and decoding,.?°+? 


For some performance-versus-complexity criteria, one or more of 
these schemes may be well suited. However, lower bounds on the 
performance and complexity of these schemes show that none can 
yield an exponentially low error probability for a rate arbitrarily 
close to channel capacity without incurring exponentially growing 
complexity; Ziv, Pinsker, and Forney have proposed some more gen- 
eral coding-decoding schemes for use with discrete memoryless chan- 
nels.%-*6 The common feature of these schemes and of the earlier 
scheme of Elias is that they incorporate two or more separate stages 
of coding and decoding as Fig. 1 illustrates.? The “inner stage” is an 
arbitrary block coding-decoding scheme, generally using maximum- 
likelihood decoding, which has just enough complexity to guarantee 
a fairly low probability of decoding error. Then the chain consisting 
of the inner coder, DMC, and inner decoder constitutes another dis- 
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Fou 
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TaBLE I—List or SyMBOLS 





Definition 


Size of channel input alphabet 

Size of channel output alphabet 

Transition probability that output is 7 if input is 7 

Block length of RS code 

Number of information symbols per RS code word 

Dimensionless rate of RS code. R = K/N 

Minimum distance of RS code 

Number of erasures to be corrected per parallel block 

Maximum number of correctable errors per parallel block 

Number of channel symbols per tree branch 

Rate of sequential code in bits per channel use 

Computational cutoff rate 

Time interval for transmission of a single channel symbol 

Number of tree branches per serial block 

Number of redundant (known) branches per serial block 

Overall information rate in bits per channel use 

Defined by: S = Né — 1 

Probability of decoding error for one serial block 

Upper bound on P,(e) 

Constants, for a given sequential code 

Sequential decoding error exponent 

Probability of error for a super block 

= — xrtny — (1 — x) fn(1 — y) 

= — xtnx — (1 — x) fn(1 — 2) 

Overall block length 

Number of sequential decoding computations to decode the jth super 
oc 

Upper bound on the probability that c; exceeds x 

Pareto exponent 

= max(a, 1) 

Number of computation units to decode a given super block 

= ne"lo A. exp [H(d)/ad] 

= Néa/(Néa — 1) A, exp [H(8)/ad] 

Size of buffer allotted to each sequential decoder 

Probability that buffer overflows before first Z super blocks are decoded 

Queue size after 7th super block is decoded 

Number of new super blocks joining queue during the decoding of the 

ith super block 

Maximum number of computations each sequential decoder can do per 

received branch 

Number of computation units to decode the /th super block 

=l+e 

= pn/Ay 

Total decoder buffer storage 


OUTER INNER DMC INNER OUTER 
CODER CODER DECODER DECODER 


Fig. 1— Two-stage coding-decoding scheme. 
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crete channel with a low probability of error or erasure. Scrambling 
and descrambling may be necessary to make this new channel mem- 
oryless. The “outer” stage or stages embody available coding and 
decoding techniques with long block length, which drive the proba- 
bility of decoding error down to a negligibly small value with a 
relatively small degree of complexity. The overall block length is the 
product of the block lengths of the individual coding stages, and the 
overall information rate is the product of the individual rates. 

The overall block lengths for these schemes are much larger than 
those known to be necessary to achieve a given error probability with 
a given information rate. However, this penalty, which is reflected 
in increased coder complexity, may be compensated for by the more 
favorable tradeoff between performance and decoder complexity. 

These multistage schemes allow transmission at any information 
rate up to channel capacity with error probabilities which decrease 
exponentially with overall block length (or its square root in Ziv’s 
scheme) ; the total decoder complexity may be large but it increases 
only algebraically with the overall block length. Notice that if the 
inner stage uses maximum likelihood decoding in order to achieve 
a low error probability for a rate close to channel capacity, its com- 
plexity increases exponentially with its block length. Thus the com- 
plexity of the inner stage may well dominate the total complexity, 
for rates close to capacity. 

We propose yet another two-stage coding-decoding scheme, which 
we call a hybrid scheme and which is described in detail in Section 
II. The inner stage involves sequential coding-decoding, which is 
known to be capable of yielding exponentially small error proba- 
bility for any rate less than the channel capacity. The decoding effort 
required of the inner stage is actually alleviated by the use of the 
outer stage, which involves algebraic coding-decoding. Section III 
contains derivations of upper bounds on error probability, distribu- 
tion of decoding computation, average decoding computation and 
probability of buffer overflow for the hybrid scheme. These bounds 
display the asymptotic performance capabilities of the scheme. The 
bounds are not sufficiently tight to be useful in obtaining detailed 
performance parameters for actual systems, but must be supplemented 
by simulations. Section IV contains some simple calculations, based 
on a previous simulation, for the performance of a hybrid scheme. 
Before describing the new scheme, we briefly review some salient fea- 
tures of algebraic coding and of sequential coding. 
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1.1 Algebraic Coding and Decoding 

Any algebraic code has an underlying algebraic structure, upon 
which the coding and decoding algorithms are based.5 For a code 
with block length N, each code word consists of N symbols picked 
from a finite field. Thus the symbol alphabet size must be a prime 
or power of a prime. The channel is assumed to either change a symbol 
to a different symbol in the field with some probability p (thus mak- 
ing an error) or change it to a symbol not in the field with some 
probability q (thus making an erasure), or pass the symbol on un- 
changed with probability 1—p-—q. 

Algebraic codes may be put in systematic form; K of the N sym- 
bols in a code word are information symbols and the remaining N-K 
are check symbols. The ratio K/N is the dimensionless rate of the 
code. The required coder complexity is generally proportional to N. 

An important property of an algebraic code is its minimum dis- 
tance, d, which is the minimum number of symbols in which any two 
code words differ. Practical decoding algorithms are available for 
certain classes of algebraic codes with specified minimum distance 
properties. These decoding algorithms generally involve a finite num- 
ber of algebraic (finite field) operations, and guarantee the correc- 
tion of up to T errors and S erasures for any J and S such that 


27+8Ssd-1. (1) 
The best known algebraic block codes are the BCH codes, for 
which both the number of decoding operations per block and the 
number of components vary with N approximately as N log N and 
with T approximately as T' log N, as shown by Berlekamp.® A special 
case of BCH codes, involving roughly the same order of decoder 
complexity, is the class of Reed-Solomon (RS) Codes.17 18 A RS 
code can be defined with any rate R and block length N, provided 
that the size of the symbol alphabet exceeds N. It can be shown that 
a RS code’s minimum distance is the largest possible, given R and N, 
that is 


d = diaz = (1 ams R)N “1, (2) 
Reed-Solomon codes are useful where the size of the code’s symbol 
alphabet can be large. 


1.2 Sequential Coding and Decoding 
Sequential coding and decoding is applicable in principle to any 
DMC. Sequential coding is also known as tree coding.1°-!? Included 
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in the class of tree codes are the easily implemented convolutional 
codes.?° 

A sequential coder accepts a sequence of consecutive binary infor- 
mation digits and, for each, generates v channel input symbols. Cod- 
ing is sequential; each channel input symbol depends only on pre- 
vious binary input digits. 

Implicit in the structure of a sequential coder is a tree, as typified 
in Fig. 2 for v = 3. Each branch is labeled with v channel input 
symbols. A sequence of binary inputs to the coder is conceptually a 
sequence of directions which sequentially steer the coder along a path 
(called the correct path) starting at the origin of the tree. Successive 
branches along the correct path are transmitted over the DMC as 
v-tuples of channel symbols. The rate of the tree code in bits per 
channel use is r = 1/v. If a rate r = w/v is required, bits entering the 
coder would be grouped into u-tuples, and there would be 2" branches 
stemming from each node of the tree. 
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001 
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Fig. 2— Tree structure of a sequential code. 
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Sequential decoding is a form of probabilistic decoding, which is 
applicable to tree codes. It is termed “probalistic” because the general 
decoding procedure applies to any randomly selected tree code and 
because the decoder is guided to a final decision by probabilistic 
considerations rather than by a fixed sequence of algebraic operations. 
A sequential decoder implicitly contains a copy of the tree, and must 
hypothesize a path through the tree, starting at the origin, which 
with high probability is the correct path. 

The Fano sequential decoding algorithm is a specific sequential 
tree search procedure which is efficient, practical to implement, and 
is amenable to analysis.1!: 12 The decoder examines received branches 
successively, makes tentative hypotheses for the corresponding branches 
of the correct path, and advances along them through the tree, if 
their likelihood, measured by an appropriate “path metric,” appears 
high enough. If the current hypothesized path appears not sufficiently 
likely, the decoder retreats one branch and starts searching for a 
more likely path. Thus there is backward and forward searching 
through the tree, with a trend toward the right, as the decoder con- 
tinually extends and revises its estimate of the correct path. If the 
rate r is less than the capacity of the DMC, the Fano algorithm 
sequential decoder can be shown to eventually trace out the correct 
path with high probability. 

The number of branch examinations, or computations done by 
the decoding algorithm to advance one branch deeper into the tree 
is a random variable. Analysis and simulation have shown that its 
mean is bounded, independent of the coder complexity, only if the 
code rate r is less than a “computational cutoff rate,” Reomp, which is 
characteristic of the channel and is always less than the channel 
capacity. 

Since the rate of transmission and the decoder’s operating speed 
are fixed, a buffer must be provided at the decoder to store arriving 
branches which accumulate during periods of intensive tree searching. 
The buffer is necessarily of finite size, and hence may overflow if a 
span of received branches requires an unusually large amount of 
computations. Buffer overflow is catastrophic, since it is accompanied 
by loss of data and subsequent disruption of the decoding process. 
It is generally the most prevalent mode of failure in systems which 
use sequential decoding. 

Restarting the decoding process after an overflow occurs is gener- 
ally possible only if the sequence of transmitted channel symbols is 
divided into blocks which are coded and decoded independently. That 
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is, at regular intervals, the coder starts afresh at the tree origin and 
erases its memory of previous information bits. Then if an overflow 
occurs, decoding can resume at the beginning of the next block. 

It will be shown that the hybrid coding-decoding scheme described 
in the Section II reduces the severe variability in decoding effort that 
is characteristic of sequential decoding, and furthermore, that for 
any rate up to channel capacity, the probability of decoding failure 
(error or overflow) asymptotically decreases nearly exponentially 
with the total system’s complexity. 


II. DESCRIPTION OF CODER AND DECODER 


2.1 The Coder 


Figure 3 shows the structure of the hybrid coder. We assume that 
N parallel independent DMC’s are available, each of which is used 
for transmission once every + seconds. These N parallel channels 
could be created by time-multiplexing a single DMC which is used 
once every 7/N seconds. The input to each DMC is from a separate 
sequential coder. The code rate is r = 1/v bits per channel use. Every 
vr seconds each sequential coder accepts a binary input digit and 
generates v successive channel input symbols which, in accordance 
with the tree structure of the code, depend on present and past coder 
inputs. However, each coder’s memory of past input bits is erased 


SEQUENTIAL 
CODERS 


NR INDEPENDENT 
STREAMS 


INFORMATION BIT 


TO N INDEPENDENT DMC'‘S 





Fig. 3—Hybrid coder structure. 
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at nvr-second intervals. Thus, successive blocks of n inputs are coded 
independently into blocks of nv channel input symbols; such inde- 
pendently coded blocks are called serial blocks, and the correspond- 
ing blocks of n coder input digits are called n-symbols. 

If a coder input digit is to be decodable with a low error probability, 
it must affect a certain minimum number of subsequent channel input 
symbols. However since the coder’s memory of previous inputs is 
erased at the beginning of each serial block, the final coder input digits 
in any n-symbol can affect relatively few channel input symbols. The 
error probability is kept low by making the last m (m < n) digits of 
each n-symbol a fixed sequence known to the decoder.!? Then each a 
priort unknown coder input digit can affect at least mv channel input 
symbols. The last m coder input digits are redundant; the net informa- 
tion rate of each sequential coder is then (1—m/n/)v bits per channel 
use. In general, n is chosen to be much greater than m, so that the 
decrease in net rate resulting from the periodic “resynchronization” 
is acceptably small. 

The N serial blocks simultaneously coded and transmitted in paral- 
lel over the N DMC’s comprise a super block. The corresponding set 
of N n-symbols which enter the coders in parallel is called a parallel 
block. NR of the n-symbols in a parallel block are independent sub- 
blocks each consisting of n—m information bits followed by m known 
bits. The remaining N(1—R) n-symbols in a parallel block are parity 
check symbols generated from the information n-symbols by an alge- 
braic block coder operating on a field of 2” elements (that is, the coder 
operates on n-symbols rather than individual bits). Each n-symbol is 
made to enter its respective sequential coder serially, as a sequence 
of binary digits at vr-second intervals. 

A parallel block is thus a member of a block code with block length 
N and a 2"-symbol alphabet. The code’s dimensionless rate is R, and 
the number of words in the code is 2”%”, 

The overall information rate of the system is 


R’ = R(1 — m/n)/v bits per channel use. (3) 


Since each DMC is used once every r seconds, the overall information 
rate is NR’/r bits per second. A source producing information at this 
rate would determine which of the 2%” block code words would be 
generated in each nvr-second interval. 

For moderate-to-large parallel and serial block lengths (greater 
than, say 50) the most eligible available block code would be a Reed- 
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Solomon code, since the required alphabet size is generally large, and 
RS codes have the largest possible minimum distance for given rate 
and block length. The alphabet size must be a power of two and must 
exceed (N + 1). This imposes a constraint on n, 


= loge (N + 1). (4) 


Typically, m might be between 10 and 100, n might be 10 or 20 times 
m, and N might be between 10 and 1000. Forney’® has pointed out that 
if nm = n’I (n’ and I integers) and 2”. = N then a RS code of block 
length N on a field of 2” elements can be implemented more simply as 
I repetitions of a RS code of block length N on the subfield of 2” ele- 
ments. Use of this smaller field for algebraic operations makes for 
simpler implementation of the RS coder and decoder. Figure 4 shows the 
structure of a super block. 

The Reed-Solomon coder may be implemented with a number of 
components proportional to N. Each of the N sequential coders may be 
realized as a convolutional coder, constructed from at most n shift 
register stages. Thus, the overall coder complexity is proportional to nN. 


2.2 The Decoder 


Not surprisingly, a decoder appropriate to the two-stage coding 
scheme just described consists of sequential and algebraic stages, as 
illustrated in Fig. 5. The first stage consists of N parallel sequential 
decoders which simultaneously and independently utilize the Fano 
sequential decoding algorithm to decode serial blocks emerging in 
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Fig. 4— Code structure: (a) block of bits entering RS codes, (b) parallel 
block (output of RS coder), (c) super block (output of sequential, coders). 
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parallel from the N DMC’s. This stage might be implemented by a 
time-sharing technique, in which a single logic unit is allocated to one 
decoder after another in turn. The second stage is an algebraic de- 
coder for the RS code. 

During the decoding of a super block, all N sequential decoders 
attempt to decode their respective serial blocks into the original input 
n-symbols. In general, some serial blocks require more computations, 
and therefore more computing time, than others. After all but some 
fixed number S (S < N) of the N serial blocks have been sequentially 
decoded, the S sequential decoders still at work are halted, and then 
all sequential decoders are free to start work on the n-symbols of the 
following super block. 

Meanwhile the present super block is passed on to the RS decoder 
in the form of a parallel block consisting of N — S sequentially de- 
coded n-symbols and S undecoded n-symbols which are treated as 
erasures. If the RS code’s minimum distance is d, and no more than 
T of the sequentially decoded n-symbols contain errors, where 


2T+S=d-—-1, (5) 


then the RS decoder is guaranteed to decode the parallel block cor- 
rectly, using a fixed number of decoding computations that varies 
roughly as N log N and as T log N.® *° In this way, those S serial 
blocks which normally would be sequentially decoded last are es- 
sentially all corrected by the algebraic decoder as soon as the first 
(N — S) serial blocks have been sequentially decoded. Thus the 
algebraic decoder’s assistance should tend to curtail the very long 
decoding times which occasional serial blocks may require and should 
thereby reduce the chances for overflow of the sequential decoders’ 
buffers. 

From relation (2), governing the minimum distance of an RS code, 


2T + 8 = (1 — R)N; (6) 


the numbers of correctable errors and erasures are proportional to N, 
for fixed rate R. 

A hybrid scheme closely related to the one described here was de- 
scribed and analyzed in Ref. 19. In that scheme the sequence of chan- 
nel input symbols is not divided into independently coded serial 
blocks. Instead, once the sequential decoding algorithm advances a 
certain fixed number of branches beyond a given n-symbol, that n- 
symbol is considered irrevocably decoded, and thus is presented to the 
block decoder as a nonerased symbol in a parallel block. As in the 
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Fig. 5 — Hybrid decoder structure. 


scheme described here, n-symbols which would require excessive 
numbers of sequential decoding computations may be decoded by 
the Reed-Solomon decoder. The asymptotic bounds on computation 
statistics are essentially similar for both hybrid schemes. The scheme 
described here appears somewhat more practical to implement. Ref- 
erence 19 also describes a simulation of the earlier scheme in which 
there are ten parallel sequential coding-decoding systems, and the 
block code word rate is either 8/10 or 9/10. The outer stage was in- 
tended to correct erasures only. The tail of the observed distribution 
of sequential decoding computation behaved as predicted by the upper 
bound of Section 3.2; the frequency of very large peaks of computa- 
tion was considerably reduced. 


III. BOUNDS ON PERFORMANCE AND COMPLEXITY 


In deriving bounds on the probability of error, distribution of com- 
putation, average computation, and probability of buffer overflow, 
we assume arbitrarily that the RS decoder corrects T = N8/2 — 1 
errors and S = Né — 1 erasures per parallel block, where 0 < 8 < ¥%. 
Half the RS code’s minimum distance is then used to correct erasures 
and half to correct errors. The value of 8 is then fixed by (6); 


PR oS DOR 
b="o Toyn2= 9” (7) 
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and 8 is essentially independent of the block length N for large values 
of N. 
Arbitrarily set m/n = 8. Then the overall rate is 


R’ = rR — m/n) = 7r(1 — a(1 — 26+ 3) >r(1 — 4)(1 — 26). 


(8) 


It will turn out that the performance of the hybrid scheme depends 
on the distribution of computation and on the error probability for 
the Fano sequential decoding algorithm. Previously known upper 
bounds on these statistics are summarized in Appendix A. The bounds 
are on averages over ensembles of tree codes. Following an argument 
of Shannon, one can show that most tree codes picked at random 
satisfy all the bounds at least to within a small constant factor.t For 
example, suppose the ensemble averages of error probability and mean 
computation per decoded bit are upper bounded respectively by X 
and Y. Then at least 9/10 of all possible tree codes have error prob- 
abilities less than 10X, at least 9/10 have mean computations less 
than 10Y, and therefore at least 8/10 satisfy both of these bounds. 

The upper bounds on the error probability?® and on the distribu- 
tion of computation®® for rates r exceeding Ryomp are known to apply 
also to the ensemble of convolutional codes, for which the coder’s 
complexity is proportional to n. This extension to convolutional codes 
has not been analytically established for the distribution of computa- 
tion for rates below Reomp;?" 22 however, it seems a reasonable con- 
jecture that the degradation in performance due to the implementa- 
tion of a tree code by a convolutional code is small for all rates. 


3.1 Error Probability 

From a result of Yudkin, it is inferred in Appendix A that the prob- 
ability p,(e) that a sequential decoder decodes a serial block incor- 
rectly is bounded by a negative exponential function of m, the number 
of redundant coder input bits in each n-symbol.?° With m = né, 


pule) < pi(e) = nA, exp [—ndvE.(r)] (9) 


where A, is a constant and #,(r) is a function of the tree code rate r 
and of the transition probabilities of the DMC. The exponent E,(r) 
is positive for any rate less than the capacity of the DMC. It is 
sketched for a typical DMC in Fig. 6. The probability of error p(e) 
for the hybrid decoder is the probability that N8/2 or more undetected 
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Fig. 6 — Sequential code error exponent Z.(r) for a typical DMC. 


serial block errors occur within a parallel block. Thus 


pe) = 2 LY pote mule) (10) 


=N6/2 


The asymptotically tight Chernoff bound for the distribution of 
sums of binomially distributed random variables may be applied to 
the right-hand side of (10).?° 


pe) S exp (—N{T.[8/2, p.@)] — H(8/2)}) OS pe) < 6/2 (11) 
where 
T(z, y) = —ximy—-A-—a2md—y) 
H(z) = —ct#&rex—-U—2&( — 2). 
It can readily be shown that for y < x < \%, 


T.(z, y) — H(x) > 0. (12) 
Thus the bound decreases exponentially with NV. Notice that 


ap ae T.[8/2,p.@)] <0 pe) < 4/2. (13) 


Thus, the exponent in (11) is monotone decreasing in p,(e), provided 
that p,(e) < 6/2; therefore p(e) can be further upper bounded by 
substituting p/(e) for p,(e) in (11) 

pe) < exp (—N{T.[8/2, pile)] — H(6/2)}) pie) < 8/2. — (14) 


The exponent in (14) will be positive if p/(e) < 6/2 < 4. By virtue of 
(9), this will be true if 


> DE Prats (2nA,/6). (15) 
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Thus p(e) decreases exponentially with N if (15) is satisfied. But 


1—R 3 





6>0 if R<1l 
and 


E,(r) > 0 if r < channel capacity. 


Thus, values of 7, 8, and m can be found for which the constraint (15) 
is satisfied, while the overall rate, given by (8), is arbitrarily close 
to the channel capacity; that is, § arbitrarily close to zero and r 
arbitrarily close to capacity. 

The overall block length is S, = nN. The serial block length n is 
constrained by (15) and by the constraint on the alphabet size of 
an RS code: 


n 2 loge (NV + 1). (4) 


Thus for fixed overall rate R’, and very large values of N, n behaves 
essentially as log, N, or at most as log, S,. This implies that for a fixed 
rate less than the channel capacity, the probability of error is bounded by 
a quantity that asymptotically decreases almost exponentially (approxi- 
mately as S,/log. S,) with overall block length S, . Notice also that the 
quantity S, is proportional to the complexity of the hybrid coder, if 
the tree codes are convolutional codes. As mentioned earlier, it seems a 
reasonable assumption that the bounds on error probability and distri- 
bution of computation apply to convolutional codes of any rate. 

The choice of 7 = N6/2 — 1 was arbitrary but convenient. For 
practical systems where WN is less than, say 50, it would undoubtedly 
be more efficient to make m large enough that p/(e) is negligible and to 
use the RS decoder to correct only erasures, that is, set 7 = 0 and 
S = NI — R). 


3.2 Distribution of Computation 

A sequential decoding computation is done every time a tree branch 
is examined and compared to a received branch. Let c be the total 
number of computations to decode a given serial block, that are done 
by a sequential decoder operating alone, without aid or relief from 
an algebraic decoder. Appendix A uses the results of References 19, 
21, 22 and 28 to show that the probability distribution function of 
c is bounded by a function which asymptotically is a pareto distribu- 
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tion. That is, 
pr (c = X) < [n*'/"A,/X]* (16) 


where a’ = max {1, a}, A, is a constant, and a is the pareto exponent, 
a function of tree code rate r and of the channel statistics. The pareto 
exponent is positive for all rates less than channel capacity, and is 
greater than unity for all rates less than Ryomp , Which is less than channel 
capacity. The pareto exponent is sketched as a function of r for a 
typical DMC in Fig. 7. Note that the average of c is finite if and only 
if a is greater than one. It is clear that the smaller a is, the slower is the 
asymptotic decrease in pr (c = x), and hence the greater is the varia- 
bility of the random variable c. The bound on pr (¢ = X) will be used to 
upper bound the distribution of the number of computations done by 
the hybrid decoder in decoding a super block. 

For analytical convenience it will be assumed that sequential de- 
coding of any serial block within a super block does not start until: 


(1) The preceding super block has been decoded. 
(12) The entire serial block has been received and stored in the 
sequential decoder’s buffer. 


These conditions ensure that successive super blocks are decoded 
independently, and that during the decoding of any super block there 
is no idle time spent by the sequential decoders waiting for new 
branches to arrive. These assumptions can only delay the operation 
of the sequential decoders in our model, and hence lead to a conserva- 
tive estimate of the buffer overflow probability. 

Decoding of a super block is essentially completed when all but 
S of its N serial blocks have been sequentially decoded. The number 
of decoding operations then done by the RS decoder is bounded by 
a fixed quantity, and will be neglected. Accordingly we define C, the 
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Fig. 7— Pareto exponent a for a typical DMC. 
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number of computation units to decode the super block to be the 
(S + 1)th largest of {c1, co,..., Cn}, where c; is the number of com- 
putations that the jth sequential decoder, acting alone, would require 
to decode the jth serial block. Then, no more than C computations 
are done by any one sequential decoder during the decoding of the 
super block. One computation unit represents one or more (up to NV) 
sequential decoding computations done simultaneously by the corre- 
sponding number of sequential decoders. 

The number of computation units C exceeds X if (S + 1) or more 
of {c1, C2, ..., Cy} exceed X. From (16), 


pr (c; = X) S p. = [n*"*A,/X]*. (17) 
Then analogous to (14) we have, forS + 1=N 8, 


pr (C = X) S exp {—N[T.(6, p.) — H(6)]} pz < 6. (18) 


A cruder but simpler bound is obtained by bounding 7',(8, p,) by 
—6§ ln py. Thus for pz < 8 


pr (C > X) S exp [NH(8)]pr° (19) 
= [A,/xy"* 


where 
A, = exp [H(8)/ad|n*’’"A, . 


From the definitions of A, , and H(6), and expression (17) for p,, it is 
easy to show that the condition p, < 6 is certainly true if X > A,. 
Also, pr (C 2 X), being a probability, is certainly bounded by unity. 
Thus 
[A,/xX]"" X> A, 
1 XSA, 
Notice that the right-hand side of (20) asymptotically has the 
form of a pareto probability distribution, but that the effective pareto 


exponent is N& times the pareto exponent for pure sequential de- 
coding. Now, 


pr(C2xX)s | (20) 


1—f 3 
ie an a 


6>0 if O<RkR<1 


(7) 


and 


a>0dO if r < channel capacity. 
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As the overall rate approaches channel capacity, « and 6 both ap- 
proach zero and the ‘break point’? A, grows very large [A, also in- 
creases as the 1/ath power of log, (N + 1) for very large values of N]. 
However, for arbitrarily small but fixed values of a and 6, the RS code’s 
block length N may be chosen sufficiently large that the effective pareto 
exponent Noe can be arbitrarily large and hence pr (C = X) arbitrarily 
small, for any X greater than A, . 

Tor a fixed value of N, the upper bound (20) is interesting only for 
X > A, or for values of a and 6 large enough that Néa>> 1. For values 
of X for which (20) is not tight, the probability pr (C 2 X) is upper 
bounded by the probability that the largest of {c, , co, --+ , ¢y} exceeds 
X; that is, it is bounded by N pr (C = X) where pr (C = X) is bounded 
in (16). 


3.3 Average Computation 


Presumably, the average number of computation units done per 
super block is bounded if Néa > 1, even if 0 < a < 1. This is true, as 
will now be shown. The average of C is written 


(Che = YX pr (C = X) 


ll 


> Xpr(C =z XY) —pCzX4+D) (21) 


ive] 


prices), 


Xe=l1 


I 


Then by (20) 


oo 


(Chew S Aa + ~> (Ai/X)"*, 


=Aitl 


The sum can be bounded by an integral from A, to infinity, since the 
integrand is positive and monotone decreasing. 


(ae SA + f (AY/xy" ax 


_ Néad, 
~ Nba — 1 


N6 alfa 
= Teme [exp [H(6)/ad]n ‘ As 


<o if Néa>l (22) 


Thus the average number of computation units per super block is 
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bounded if the effective pareto exponent Néa exceeds unity for any 
overall rate that is arbitrarily close to capacity, if NV is chosen sufficiently 
large. 

The bound on (C),, varies with n as n*’’*. Note that the number of 
computation units Cis a bound on the number of computations done by 
each of the N sequential decoders, and that the number of information 
bits decoded by each sequential decoder per super block is no more than 
n. Thus the average number of sequential decoding computations per 
information bit is bounded by 


CO. = Aw Nia > 1 (23) 
where 
Néa 
A, = WR, — 1 LexP [A (4)/e8) AI. 


Since the block code is Reed-Solomon, n is constrained by 
n = log, (N + 1). The overall block length (reflecting the complexity 
of the hybrid coder) is nN. Thus the minimum possible value of n 
behaves as the logarithm of the overall block length, and the average 
computation per bit increases as the (a’/a — 1)th power of the loga- 
rithm of overall block length. Furthermore, ifr < R.omp then a’ = a > 1, 
and the average computation per bit is independent of the overall block 
length. 

For rates above R.omp , the exponent a’/a increases rapidly with rate, 
approaching infinity at channel capacity. Thus the bound on the 
average computation, although finite, increases very rapidly with rate 
above R.omp - The average computation observed in the simulation 
reported in Reference 19 did indeed increase very rapidly with rate 
above Romp - 


3.4 Probability of Buffer Overflow 


A new super block arrives to be decoded once every nvr seconds. 
Each of the N sequential decoders is provided with a buffer which 
is assumed to store the latest Bu received output symbols from its 
respective DMC. Since we have assumed that all symbols comprising 
a super block must have been received before any decoding of the 
super block can start, the total storage must be large enough to con- 
tain one or more super blocks, that is, B must exceed n. Whole or 
partial super blocks stored but not yet decoded form a queue. 

If the queue exceeds B/n super blocks (Bu channel output sym- 
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bols per DMC) buffer overflow occurs. We wish to upper bound 
P,(B), the probability that the buffer overflows before the first L 
consecutive super blocks are decoded, given that the decoder starts 
with initially empty buffers. 

Let q; be the number of undecoded super blocks in the queue just 
after 1 consecutive super blocks have been decoded. Let X; be the 
number of new super blocks which arrive to join the queue during 
the decoding of the ith super block. Because of our convention that 
decoding of any super block does not begin until the entire block 
has joined the queue, the number X; does not include the ith super 
block itself or later super blocks. The random variables X; and q; 
are not necessarily integers, since a fraction 1/n of a super block 
arrives to be decoded every vr seconds. 

When decoding of the first super block starts, the queue consists 
of only the first super block. Just after the first super block is decoded, 
the queue is thus diminished by one but has been increased by XX}. 
Thus . 


m=1l-14+%X%,=%X. (24) 
Just after the second super block is decoded, 
= j > 
fa \* 1+X, if w21 (25) 
X» if gq <1. 
This is upper bounded by q, + Xz for any q,; = 0. Therefore 
@s X, + X2. (26) 
Similarly, 
1 See if gq 21 
;= 
Xs if g@<1 
=< X,+X.+ xX; forany q 2 0 (27) 
By induction then, 
GS DX. (28) 
=1 


This upper bound increases monotonically with 2. It is clearly a crude 
approximation for large 7. However it will turn out to yield a theoret- 
ically interesting upper bound on pz,(B), at least for values of DL 
which are small relative to B. 
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p(B) =pr(a+1l2B/n) or (+12 B/n) or 
(qc + 1 2 B/n)] 


= pr [max {q,, @, °°: , dz} 2 (B— n)/n] 
<pr| Oxe2 @- m/l: (29 


This inequality follows from (28) and the fact that all X, = 0. 

Suppose each sequential decoder is capable of doing up to » computa- 
tions in each v7-second interval, during which time a new branch arrives 
in each buffer. The parameter » must be several times greater than the 
average number of computation units that the hybrid decoder does per 
information bit, if the decoder is to keep up with the incoming data. 
The hybrid decoder is ‘‘busy’”’ (doing exactly » computation units every 
vr-second interval) until it is about to start decoding a super block 
which has not yet completely entered the buffer. From that instant it 
is idle until the entire super block has entered the buffer, at which time 
it becomes busy again. Thus, a busy interval can only be initiated just 
after the arrival of some super block, and can end only upon completion 
of the decoding of some subsequent super block. Suppose that during 
a particular busy interval, the vth through (» + 7)th super blocks are 
decoded (y, » integers; L = v 2 1, » 2 0). Let C, be the number of 
computation units to decode the /th super block. Thus >_?*? C, is the 
total number of computation units done during the busy period. The 
first new super block to arrive during the busy interval arrives after 7 
computation units have been done; thereafter, super blocks arrive every 
yn computation units. Thus (1/yn) 527+? C, super blocks arrive during 
the entire busy interval. Successive busy intervals do not overlap, and 
therefore until the Zth super block is decoded, 


L L 
2 X, S (1/un) > C,. (30) 
Thus, from (29), 


p(B) S pr Pp Ce 2 vB — » |: (31) 


Since coding and decoding is independent from one super block to the 
next, the random variables {C, , ¢ = 1, 2, --- , L} are statistically 
independent, and have a common cumulative probability distribution 
function which is bounded by the asymptotically-pareto distribution 
function (20). 
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The probability that overflow occurs before the first block is de- 
coded is 


p(B) spri.2@-m<[aes|". ee 


In appendix B an upper bound is obtained for the probability distri- 
bution of a sum of LZ statistically independent pareto-distributed ran- 
dom variables.* If the distribution of each random variable is upper 
bounded by pr (C; 2 X) S (A/X)’, s > 1 then it is shown that 


pr| Dez] < doae/sy’, (38) 


where D = 1 + e°. This bound is valid for values of L which are small 
relative to y; specifically, for 


(LA/y)in(y®/A*L)tn(y/A) < e. (33a) 


Applying inequality (33) to (31), we obtain the following bound 
for the probability of buffer overflow before L super blocks are 
decoded: 


p(B) < pi| ae] Néa > I (84) 


provided that 


LA, [u(B — n)]"** B=”) oa 
Tea eo fe ara 


Condition (84a) will be satisfied for values of LZ which are small 
relative to the product of decoder speed and available buffer size 
p(B-n). Inequality (34) then indicates that p,(B) tends to increase 
linearly toward one with LZ and to decrease asymptotically as the 
negative (NVéa)th power of »(B-n). 

The techniques used to bound p,(B) were too crude to yield a useful 
result for small values of »(B-—n) or relatively large values of ZL; if 
condition (34a) is not satisfied, p,(B) can only be estimated by heu- 
ristic reasoning. The waiting line of undecoded super blocks can 
increase during the decoding of a given super block only if C, the 
number of computation units to decode the super block exceeds the 
number of computation units the decoder can do in nvr seconds, that is, 


* Jelinek has given a more easily derived upper bound, which in its dependence 
on L, is at least as tight as our bound forl Ss = 2 
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if C > np. The probability that the queue increases is bounded by 
(20) with X = np. 


pr (C > np) < (Ai/nu)”**. 


If the decoder’s speed factor » is made large enough so that np > 
(C)ay Where (C)ay is bounded by (22), then the probability that the 
queue increases during the decoding of any super block approaches 
zero as N approaches infinity. Then the queue would be expected 
to remain close to zero most of the time, and consequently the proba- 
bility that the buffer overflows during the decoding of a given super 
block would be approximated by p,(B), the probability that the first 
super block causes buffer overflow. For this reason, we use p(B), 
bounded by (32) as a measure of buffer overflow probability. 

It was shown in Section (3.3) that if N&a > 1, the mean computa- 
tion per super block is bounded by 


Néa 
(Chis s Néa oa 1 A, ° (22) 


A hybrid decoder which can perform at least (C),, computation units 
in a nur-second interval can, on the average, keep up with the in- 
coming stream of super blocks arriving at nvr-second intervals. A 
necessary condition for np >(C)ay is 


eS em Ley erent CNET AP (35) 


where p, is any number greater than N8a/(N8a — 1). 
Under condition (35) »1(B), given by (82), is bounded by 


n Nba 
pi(B) < lat | : (36) 
A fairly realistic measure of the cost of the hybrid scheme is the 
total amount of buffer storage utilized. If each of the N individual 
sequential decoders has a buffer capable of storing Bv channel output 
symbols, the total number of symbols which can be stored is 

S, = BNo., (37) 


Suppose we set 


B= n(1 + e/u). (38) 
Then 


714 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1969 


p(B) < exp [—Néa] 
= exp [—S, da/Bv] 


—S, ba 

ae E + al oe 
For very large values of N (and therefore of S;), the necessary 
value of serial block length n increases no faster than log.S; to fulfill 
the constraints (4), (87) and (38). Consequently, the buffer overflow 
probability »:(B) is bounded by a quantity that asymptotically 
decreases almost exponentially with the total decoder storage S; (that 
is, as S;/loge S;). Furthermore, the exponent in (39) is positive pro- 
vided that 6 > 0 and a > 0. These conditions may be met for any 
overall rate R’ which is less than channel capacity if the tree code 
rate r is less than channel capacity, and 8 is small enough so that 
condition (8) is fulfilled. The derivation of this result suggested that 
best use would be made of a large but fixed amount of buffer storage 
if the number of parallel sequential coding-decoding systems is as 
large as possible, while the amount of storage allocated to each is a 

relatively small fixed multiple of the serial block length n. 


[V. A NUMERICAL EXAMPLE 


The upper bounds of the previous section are generally useful only 
if one is interested in asymptotic performance. Calculation of per- 
formance parameters for an implementable system should be based 
on the results of simulations. In this section we illustrate the estima- 
tion of performance parameters, based on a simulation of a sequen- 
tial decoder. 

Reference 25 describes the computer simulation of a Fano algorithm 
sequential decoder which decodes convolutionally coded binary anti- 
podal signals received from a quantized phase-coherent white gaus- 
sian noise channel. For a convolutional code rate r = 1/7 bits per 
channel use, a signal-to-noise ratio of —6.5 dB, and an 8-level channel 
output quantization scheme, the pareto exponent « was very close to 
unity, that is, Reomp was close to 1/7. Other parameters are: 


(t) serial block length n = 360 branches 
(tz) number of redundant branches per serial block m = 24 
(%zi) convolutional code constraint length = 24 branches. 


The net information rate of this system was then 


1 360 — 24 
7 860 = 0.133 bits per channel use. 
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Assume the following RS code parameters 


(t) Block length N = 31 = 25 — 1. 
(iz) Alphabet size = 32 = 2°, so that each super block is a se- 
quence of 72 RS code words. 
(iw) Rate R = 26/31, so that 5 serial blocks out of 31 are check 
symbols. 
(tv) The RS decoder is designed to correct no errors and up to 5 
erasures per RS code word. 


The RS decoder would be easy to implement. A 155-bit register is 
required to store a RS code word consisting of 31 32-ary symbols. In 
addition, circuitry must be provided to solve 5 parity check equations 
to find the values of up to 5 erased 32-ary symbols. Forney has de- 
scribed efficient techniques for finding values of erasures.t®° The number 
of RS decoding operations is on the order of the square of the number 
of erasures which can be corrected. 

Reference 25 shows empirical probability distribution functions for 
the total number of computations per serial block as observed in the 
simulation. For example, for the —6.5 dB channel, the probability that 
c, the number of computations per serial block exceeds 36,000 is approxi- 
mately 10°’. Thus the probability pr (C = 36,000) that the number of 
computation units to decode a super block exceeds 36,000 equals the 
probability that 6 or more of the 31 serial blocks require more than 
36,000 computations. This probability is obtained from tables (8. 
Weintraub, Tables of the Cumulative Binomial Probability Distribution 
for Small Values of p, London: Collier-Macmillan, 1963). 


36 
pr (C = 36,000) = > | pia ~ p)§ = 6X10 (p = 10%). 


i=6 Lt 


Now assume that each sequential decoder is fast enough to do p = 
50 computations between received branches. Then, up to 360n = 18,000 
computations can be done by each decoder in the time taken for one 
new serial block to enter the buffer of each; hence if each sequential 
decoder has a buffer with a storage capacity of three serial blocks, 
the buffer storage will overflow (starting from the initially empty 
state and assuming that decoding of a block starts after it is within 
the buffer) if the first super block requires more than 2 X 18,000 = 
36,000 computation units. Then, assuming overflows are rare enough 
to be nearly statistically independent, the buffer overflow probability 
per super block would be about 6 x 10-7. Each decoder’s buffer stores 
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3 X 360 = 1080 received branches, and the total number of branches 
stored is thus 1080 N = 33,480. The total number of bits (one per 
branch) per super block is 860 x 31 = 11,160. 

Taking n6 = m = 24,0 = 7, E,(r) logs e & Reomp & 1/7, and assuming 
that A, = 1 and that the upper bound (9) holds for convolutional codes, 
we have a rough upper bound for p,(e), the probability of undetected 
error per serial block. 


pile) < 360 X 2°** = 2.23 * 10°. 


(In the simulation, none of 1331 decoded blocks contained undetected 
errors. ) 

The probability of an undetected error for a super block is the 
probability that one or more of the 31 serial blocks has undetected 
errors; this probability is upper-bounded by 31 X 2.23 x 10° = 
6.9 xX 10-*. This probability may be considered too high. It may be 
decreased about 3 orders of magnitude by increasing the value of 
m from 24 to 34. The resulting increase in the serial block length 
from 360 to 370 should cause negligible effect on the distribution of 
computation per serial block. 

The net information rate of this system is rR (n—m) /n = 0.109 bits 
per channel use. It can be shown that the required signal-to-noise 
ratio per information bit is about 4.7 dB above Shannon’s theoretical 
minimum for the infinite bandwidth white gaussian noise channel. 

By such simple calculations based on extensive simulations, one 
can optimize the parameters of a hybrid scheme to meet given cost 
and performance criteria. 


Vv. CONCLUSIONS 


In the hybrid decoding scheme the number of decoding computation 
units per super block is a random variable, reflecting the probabilistic 
character of the sequential decoders’ operations. However the pareto 
exponent is proportional to N; the frequency of large peaks of compu- 
tational effort is reduced by algebraic decoding of the occasional serial 
blocks which otherwise would require excessive sequential decoding 
computation. 

It was shown that for any overall information rate that is strictly 
less than the channel capacity, a finite minimum value of parallel block 
length N can be specified such that the average number of sequential 
decoding computations per bit is bounded by a quantity varying as 
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n*'/*- where a is the original pareto exponent for the sequential 
decoding components and a’ = max {a, 1}. The number of algebraic 
decoding computations per bit is a fixed number which is almost inde- 
pendent of parallel or serial block length.° 

It was also shown that for a proper choice of parameters, the error 
probability decreases nearly exponentially with the overall block length, 
and (heuristically) that the probability of buffer overflow asymptotically 
decreases almost exponentially with the total amount of storage at the 
decoder. These results can hold for any overall information rate which is 
strictly less than the channel capacity. 

A rigorous upper bound was also obtained on p,(B), the probability 
that the buffer overflows before L super blocks are decoded. The bound 
is valid for u(B — n) > L, and behaves as L[A,e/yB]"** for B >> n and 
fixed effective pareto exponent Néa. 

The hybrid scheme shares the multistage feature with the schemes 
of Ziv, Pinsker, and Forney.1*-!* In Ziv’s scheme, there is an in- 
termediate stage in which errors made by the inner block coding 
stage are detected and treated as erasures. After a scrambling-de- 
scrambling procedure these erasures are corrected by an outer block 
coding-decoding stage. Forney’s scheme has two stages; a large alpha- 
bet RS code outer stage corrects errors and/or erasures made by an 
arbitrary inner block coding-decoding stage. Pinsker’s scheme utilizes 
sequential coding-decoding for the outer stage. The principle is that if 
the inner stage has a sufficiently low error probability, the rate Reomp 
seen by the outer stage is little different from channel capacity. (This 
is, in a sense, the inverse of our hybrid scheme.) 

In the hybrid scheme described in this paper, the inner and outer 
stages embody sequential (probabilistic) coding-decoding and alge- 
braic coding-decoding respectively. Sequential coding and decoding is 
practical to implement and is efficient for any given DMC, which 
might be created from a physical communication channel by effi- 
cient modulation, demodulation, and quantization.?© ?* The number of 
computation units per super block is a random variable, reflecting 
the probabilistic nature of sequential decoding and of short-term 
channel behavior. However, the variability of the sequential decoding 
computational load is eased substantially by the outer (algebraic) 
stage. Thus, in contrast with previous multistage schemes, the outer 
decoding stage assists the inner decoding stage, as well as correcting 
its errors. 

Modifications and generalizations of the hybrid scheme are pos- 
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sible. A related scheme, in which channel symbols are not organized 
into independently coded blocks, was studied in Reference 19. An- 
other modified hybrid scheme, falling into the general class of con- 
catenated schemes considered by Forney, is implemented by imposing 
an upper limit X, on the number of computations any sequential 
decoder can do on a serial block.1* Assuming the speed factor p is large 
enough that X, computation units may be done in the time taken 
to receive one super block, no queue of undecoded super blocks can 
build up, and the buffer overflow problem is eliminated. Instead, any 
super blocks requiring more than X, computation units are passed on 
to the user as erasures. The probability of erasure is then bounded 
by the right-hand side of (19) with X = X, > Ai, that is, it de- 
creases exponentially with parallel block length N. 

The multistage approach embodied in the hybrid scheme would 
also appear to be useful for real channels with memory, where errors 
or severe channel disturbances occur in bursts, usually separated by 
fairly long intervals with only scattered random errors. If the N 
serial blocks comprising a super block are transmitted consecutively, 
a burst occurring during the transmission of one or more consecutive 
serial blocks would likely render them nearly undecodable by se- 
quential decoding. Then if the burst did not extend over more than S 
serial blocks, an outer Reed-Solomon or other burst-correcting stage 
could correct the resulting erasures. The application of hybrid or 
other multistage coding schemes to real channels with memory is an 
interesting area for future investigation. 

Any “hybrid” or “concatenated” coding-decoding scheme, incor- 
porating a number of separate parallel coders and decoders would 
likely be orders of magnitude more complex than present day coding- 
decoding schemes for discrete memoryless channels. However the ad- 
ditional complexity may be a tolerable price to pay for the benefits 
of increased reliability and more efficient utilization of the commu- 
nication channel. It is also well to remember that highly complex 
digital systems are becoming increasingly feasible as a result of rapid 
progress in integrated circuit technology. 
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APPENDIX A 


Bounds on Performance for Sequential Decoding 


Various upper bounds on the probability of error and the distribu- 
tion of computation for the Fano sequential decoding algorithm have 
been given in References 20, 21, 22, and 28. All these bounds were 
obtained by random coding arguments, that is, by averaging over an 
ensemble of tree codes with a given probability distribution. The 
results apply to an arbitrary DMC with a P-symbol input alphabet 
and Q-symbol output alphabet, and a transition probability matrix 
{qi}. We shall summarize some of these previous bounds and then 
shall relate them to the performance of the hybrid scheme. 

Using Gallager’s notation, we define the function 


Q P l+p 
Ep) = —tn >) | Spat | ; OSp<e 


7=1 i=1 
where 


{pi} 1 = 1, 2,..., P is the probability distribution on the channel 
input symbols which maximizes E,(p) .* 


It can be shown that #,(p) is a nondecreasing function of p, that 


E,(0) = 0 
and that 


lim = B,(p) = C, 
p70 P 
where C, is the capacity of the DMC in bits per channel use. 

Any transmitted serial block is a sequence of nv channel input 
symbols which label the corresponding correct path through the 
code tree. A path which diverges from the correct path is termed an 
incorrect path. A sequential decoder makes an undetected error at 
some node lying on the correct path, if the pattern of channel symbol 
transitions causes the decoder to reach the end of the serial block 
while on some incorrect path stemming from that node. One or more 
branches following the node will then have been decoded incorrectly. 
Of the n coder input bits which generate a serial block, m (the final 
m) are known to the sequential decoder. Hence a necessary condition 
for an undetected error to occur in decoding any serial block is that 
an incorrect path exists whose corresponding sequence of coder input 
digits matches that of the correct path in m or more places, and 
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which the decoding algorithm can follow past those m places. The 
probability p,(e) that this necessary condition is fulfilled for say 
the hth node lying on the correct path has been upper-bounded by 
Yudkin, by random coding arguments.?° 


ple) < A. exp [—mvE,(r)] (40) 


where A, is a constant and E,(r) > 0 forO0 S r < C, . The exponent 
Er) is sketched for a typical DMC in Fig. 6. It is considerably greater 
than the unexpurgated error exponent for block codes with the same 
rate. In fact, Z,(r) = E,(1) for rates below r = E, (1) log. e bits per 
channel use. This result for convolutional codes was also shown by 
Viterbi.”” The probability of error p,(e) for a serial block is upper 
bounded by the probability that the necessary condition for undetected 
error occurs for one or more of the n nodes on the correct path. By 
the union bound, 


ple) S 3 pe) S$ nA, exp [—mE,(r)}. (41) 


Inequality (9) follows from this result with né substituted for m. 

Consider tree codes of rate r bits per channel input, where the tree 
extends infinitely to the right. The incorrect subset of the hth node 
lying on the correct path is defined to consist of that node plus the 
infinite set of nodes lying on incorrect paths which stem from the Ath 
node. Let y, be the total number of computations (examinations of 
branches) ever done on nodes within this incorrect ubset. Then -y, iss 
a random variable over the ensemble of tree codes and channel transi- 
tion sequences. The sth (s > 0) moment of y, is bounded by a fixed 
quantity Aé for rates r such that 


r < (L,(s)/s) loge e. (42) 


The quantity A‘ isa function of s, 7, {p;} and {q;;}. This was established 
for integral values of s by Savage, for all s 2 1 by Yudkin, and for 
0 < s S 1 by Falconer.”’”~” In particular, note that the mean of y, 
is only bounded for r < (1) log, e. The quantity H,(1) for a DMC is 
also denoted by Ryomp , that rate below which the mean computation is 
finite. This bound on (y;),y leads to an upper bound on the probability 
distribution pr (y, 2 x) by use of the Chebyshev inequality.” 


pry, & 2) S yi)et s>0 
< Aix™* rr < (E,(s)/s) log, e. 


(43) 


HYBRID CODING SCHEME 721 


The pareto exponent « is defined parametrically by 


r = (E,(a)/c) log: e. (44) 
Then for any e > 0 


pry = 2) = (4/2). (45) 


The right-hand side of (45) is proportional to a pareto probability 
distribution. The positive quantity « may be made arbitrarily small 
by setting A, large enough. Henceforth, we shall ignore « as trivial 
since it would not affect our asymptotic results. Thus, we write 


ex = AS (46) 
and 


pr (y, 2 zt) < (A./2)* (47) 


where 


E,.(a)/a = r/log. e, 


where the rate r is in bits per second. The pareto exponent for a 
typical DMC is shown in Fig. 7. The exponent on the right-hand 
side of (47) agrees asymptotically with that of a lower bound on the 
distribution of sequential decoding computation derived by Jacobs 
and Berlekamp.”* 

Let us now relate this upper bound on the distribution of computa- 
tion for the Fano sequential decoding algorithm to the sequential 
decoding of serial blocks in the hybrid system. Only the portion of 
the code tree to a depth n branches from the origin is used to code 
and decode a serial block. Furthermore the last m information digits 
are known to the sequential decoder. Truncating a tree at a depth of 
nm branches and making known the final m information letters can 
only reduce the number of branches a sequential decoder must exam- 
ine before completing all computations in the first n incorrect subsets 
of an infinitely deep tree. Furthermore, it can be shown that for 
the Fano sequential decoding algorithm, allowing the decoder to 
search branches beyond depth n cannot reduce the number of com- 
putations ultimately done within a depth of n branches. Therefore, 
if c is the total number of computations to decode a serial block, 


cs per ~ (48) 


hel 
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and 


Cia S (p> nm} > (a > 0). (49) 


The right-hand side of (49) may be bounded with well-known in- 
equalities,?® 


n 


n a > (yay 0 < a = 1 (50) 
(Ex) <4 
hel av n a 
Ds (oat) aZzl (51) 
=1 
Since 
(yew < AZ; O<a<o, forall h, 
we have 
(en. < fee 0<aKd<l (52) 
n° At al (53) 
or 
(Ce <n AG (54) 
where 


a’ = max (1, a). 


Then pr (c = x) is bounded using Chebyshev’s inequality 
pre = 2) S Cue" Sn (A./2)° (55) 


where a@ is given parametrically by r = [E,(a)/a] log, e. 
APPENDIX B 


Probability Distribution of a Sum of Independent Pareto-Distributed 
Random Variables 


It is required to upper-bound 
L 


where the {C,} are a set of independent positive integer-valued ran- 
dom variables whose distribution is asymptotically bounded by a 
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pareto distribution function 
(A/x)’, ge A 
1, 0O<2<A 


pr[C; 22] Ss (56) 


where s is greater than one. The following assumption will be found 
necessary 


[ 
(A/y) fn (y'/A'L) tm (y/A) < = (57) 


This assumption is tantamount to requiring that y be large relative 
to DL. 

We shall split the required probability into two parts, one of 
which is bounded by a union argument, and the other by use of a 
Chernoff technique (Reference 12, p. 97). That is, we write 


L 
pt| Soe u|=n+p. (58) 
7=1 
where 
L 
Pp. = pr > C;2y; oneormoreof {C;} = v| 
t=1 
L 
P2 = pr| > C,;2y; all {C,;} < v|- 
t=1 
But 


p, < pr [one or more of {C;} = y] 
L 
< UprC.2y (59) 


t=1 


by the union probability bound. So, substituting (56) into (59), we get 


p,<L(A/y’ y>A>il 
s> 1. 


(60) 


The probability pe may be bounded using the Chernoff technique, 
since each random variable C;, being upper-bounded by y, has a finite 
moment generating function. To bound pz we first define 


fas = pr (C; = 2;) 2 =1,2,--- ,y; (61) 
4=1,2,---,L 


724 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1969 


and 


eG) S ‘ =, (62) 
0 z#2£<0 


Then by definition, 
y-1 y-1 y-1 L 
p= St Sta Stet] Say] 


We upper-bound the step function ®(x) by the exponential function 
exp(Av), where » is an arbitrary positive quantity. We shall later 
choose a convenient value for ». The right-hand side of (63) can 
now be bounded by a product of sums. 


P2 S > fs, = Fa bs fe, EXP Rew = ) | 


zi=l za=1 L=1 t=1 


exp (—Ay) Il | 5 LOxp 02) | (64) 


i=] zi=1 


exp (—)y) |= f. exp os) | (65) 


z=] 


since the random variables {z;} are identically-distributed. 
Now let 


y-1 : 
v= Dif. exp (2). (66) 
This may be expressed in terms of the distribution function pr (C = 2). 


f,=pr(C =z) =pr(C 22) -—pr(C22+4+1)) (67) 


So, 
y= Dew dir 2) —p C224 v) 
=1+ D lex 02) - ep OG - Np C Ba 
— exp((y — 1) pr( 2 y), (68) 
since 
pr (C = 1) = 1. 


Taking out the common factor [1 — exp(—A)] and upper-bounding 
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it by A, we get 


Pease CA > exp (4) pr (C = 2) 


<1+) p exp (Az) pr (C = 2). (69) 


The function y is further bounded by employing the upper bound 
(56) for pr (C 2 2). 


y<1+) > exp (Az) + > exp (Az)(A/z)*. (70) 


We now express the exponential functions as convergent power series 
and interchange the order of summation to yield 


y<1i+t Yd exp (dz) -+ ee veh > ae (71) 


z=A+l1 


The sum over zg may be upper bounded by an integral, which can 
be evaluated and bounded by simple expressions 








At 
< < sj— 
roea 1shSsj-1 
y-1 y h-s =, 
girhes < / ghios dz < A n (y/A) s 1 < h = a (72) 
z=aA+l1 A y 8 <— h < 8 + 1 
h-s 
y 
| b> spl 


These bounds will be used to bound the right-hand side of (71). 
The first sum in (71) is bounded by the number of terms times the 
largest (last) term. 


> d exp (Az) < A exp (AA). (73) 


Therefore, defining h, to be that integer for which hh + 1 > s 2 A, , 
we have 


wv <1 -+ A exp (AA) + ps acbiecn t pom (y/A) 
+AU ay SS OM cy 


h=ho+2 h! (h _ S) 
In the final sum in (74), h 2 h, + 2 > s + 1, and hence the sum may 
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be upper bounded by bounding h/(h — s) by h, + 2 and then extending 
the summation down to h = 0. Thus, 


(Ah) > Wt. J 5 he + 2(A/y)" > Our 


h=hot+2 


= (h, + 2)(A/y)’ exp (Ay). (75) 


Furthermore in the first sum in (74), 2 S hp — 1 S s — 1, and hence 
the sum may be bounded by bounding 1/(s — h) by 1 and then ex- 
tending the summation to infinity. Thus 


> aap <A 3B ay =A exp(\d). (76) 


Sinces 2 hy, 


y < 1+ 2X4 exp (AA) + (8 + 2)(A/y)’ exp (ry) 





QA)* oe 
+ Ge pi W/4) +S wy. (77) 
We shall now choose a particular value for A: 
1 & 
=r, = 4m (24). (78) 


We also assume that DL is small enough relative to y so that 


Alin (y/A)] < + (79) 


This assumption is equivalent to (57). This condition also ensures 
that AA < 1/eL < 1. The terms of (77) may now be bounded 
separately to yield a convenient upper bound on w. Thus, 


QA exp (HA) < 2/L. (80) 
From (78), 


(s + 2)(A/y)’ exp (Ay) = (s + 2)/L. (81) 
Finally, using (78) and (79) it is easy to show that 


Oa 


The function y is now upper ae for A = A, by using (80), 
(81), and (82) in (77), 


Pe PAR ash Ge DL 1646 48)/L. (83) 


Boy a Lape O7d, (82) 
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Then, 


“= exp og | < [1+ 6+ s)/Z)’. 
= 0 


Now for any L, a 
t+ a/Ze =1 404+ 2S —4 ayy 


eee Ce 
= 1 


<1l+ta+a’/2!+ --- = exp (a). 
Hence, 
vy” < exp (6+ s). 
Substituting (86) in (65), we obtain 


D2 < L(Ae/y)’ exp (6). 
Finally, after substitution of (87) in (58) and (60), 


pr| Deve v| < Deady 


t=] 


where 


=1+é. 
This bound is valid under the condition (57), 


(A/y) fn (y’/A°L) fn (y/A) < 1/eL. 
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Convolutional Reed-Solomon Codes 


By P. M. EBERT and S. Y. TONG 
(Manuscript received July 28, 1968) 


We derive a new family of convolutional character-error-correcting 
codes which are a convolutional form of the Reed-Solomon block codes and 
as such have nonbinary symbols. We also derive a bound on the error cor- 
recting capabilities of these codes in which the error-correcting capability 
per constraint length grows approximately with the square root of the 
constraint length. 

When these codes are used on a binary channel they are effective for both 
random and burst error correction because a single character spans several 
channel digits. 

These codes have greater error-correcting capabilities than the Robinson- 
Bernstein self-orthogonal codes but are harder to decode. The single- 
character-error-correcting codes, when interleaved, are shown to be more 
powerful than the equivalent Hagelbarger code and appear to be simpler to 
implement. They are also slightly better than the interleaved version of 
Berlekamp’s code. 

We discuss encoding and decoding algorithms and illustrate a simple 
decoding algorithm for some of the codes. These codes are closely related 
to the Bose-Chaudhuri—Hocquenghem block codes and share with them the 
decoding simplification for character erasures in place of errors. Any 
Bose-Chaudhuri-H ocquenghem decoding algorithm can be used to decode 
these codes. 


I. INTRODUCTION 


This paper is concerned with a family of character error correcting 
convolutional codes which are derived from Reed-Solomon block 
codes. The derivation of the error correcting capabilities is easy be- 
cause of the algebraic nature of the code; the convolutional nature of 
the code allows the use of a simple encoder even at high rates. Also, 
sequential decoding techniques might be applicable. 

Throughout the paper we use elements of a finite field as symbols 
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instead of just 1 and 0. The elements can be represented by k-tuples 
of ones and zeros if the field has 2* elements; these k tuples are called 
symbols or characters. Thus with this code we are able to correct 
character errors which may be bursts of binary errors. All that one 
need know about finite fields is that each nonzero element has an in- 
verse and that there exists at least one element which when taken to 
successive powers will generate the entire field with the exception of 
the zero element. This is called a primitive element. 

The capability of the codes is given by the number of errors that 
can be corrected within a fixed number of characters (the constraint 
length). Suppose a code can correct three errors within a constraint 
length of 12. Then the code can correct any pattern of errors as long 
as no sequence of 12 characters has more than three errors in it. In 
the context of error correcting ability one can define a minimum dis- 
tance of the code. For linear codes the minimum distance (d), is 
equal to the minimum weight code word segment, one constraint 
length long, which has a nonzero first character. With this definition 
the code can correct up to (d-1)/2 errors occurring within one con- 
straint length. 

The rate of a code, R, is the fraction of characters used as informa- 
tion symbols. 

A convolutional code has its check symbols formed as a convolu- 
tion or weighted sum of a fixed number of the past information sym- 
bols. The weighted sum is formed in the algebra of the finite field. 

The codes described in Section II are capable of correcting t char- 
acter errors within a constraint length of (20? — t + 1)/(1 — R) 
channel characters. The channel characters must be elements of a 
finite field of size at least [R(2t — 1) (¢ — 1) + 1]/(1 — RB). If 
one uses as channel symbols elements of a much larger field it is pos- 
sible to construct a code which will correct ¢ errors with a constraint 
length of only 2¢/(1 — #). Those codes can be encoded by a stand- 
ard convolutional encoder or by a number of accumulators which can 
also perform multiplication. Decoding can be accomplished by a 
modified Bose-Chaudhuri-Hocquenghem decoder. 


II. CONSTRUCTION OF THE CODES 


For a code of rate R = (6 — 1)/b one can use every bth symbol as a 
check symbol. To define the code completely one need only give the 
weights used in the convolution of past channel symbols. For con- 
venience we put these weights in an NV by b = 1/(1—R) matrix, B. B; ,;= 
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W,-1+.; (The constraint length is Nb.) If x; is a check symbol we write 


bN-1 
t= — 2) o-.W,, t= 6, 2b,--- 
k=1 
or 
bN-1 
D» 2-.W, = 0, W, = 1. (1) 
k=0 


Following Wyner and Ash we take the N by 6 matrix called B and form 
it into an infinite A matrix by shifting B to the right b places and 


down one place:” 
00 00 
Bi i" 
B : 
a mal I» ; 
00 00 . 


Then by (1) any code sequence written as a vector x will satisfy 
Ax = 0. (2) 


We define the code by defining the elements of the matrix B. B is used 
instead of the weights W; because the notation is clearer. Denote the 
elements of B by B;; . Then let B;; = B;yi~’ where the b y,’s are o°, 


a’, +++, a”, 0. The symbol a is a primitive element of the field, and 
0° is taken as 1. The y,’s are called locators. Let: Bj = a? °°), 
where 


;—2 . . 
ae, 0G- DU—2). 
j= dX a 5 
Thus for any b, N, and finite field a code is defined. 
As an example we take the special case which was presented by 
Wyner, where N = 2 and the Bj; are given by? 


eee yim DGHD) pez 
ii ou» t =, b 
B= |) ie. ee 1 a 3) 
laa -<--- a 0 


This code has d = 3 and thus can correct single errors, or double 
erasures in a constraint length of 2b. It has the additional advantages 
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that it can be easily implemented, it is optimal,* and it has no error 
propagation. We describe, in Sections III through V, the implementa- 
tion and some properties of the single-character-correcting codes. 


III. DECODING TECHNIQUE FOR SINGLE CHARACTER CORRECTING CODES 


Single errors are particularly easy to correct because the first 
syndrome (difference between the calculated check character and the 
received check character) is equal to the error, and the second syn- 
drome is equal to the error multiplied by the error location. Since 
we take 


i-l 


YiFa is, 4b (4) 
7; = 0, 1~=6b 


the error location 7 can be found by dividing the second syndrome 8, 
by a@ and comparing the result to the first syndrome, So. This is 
repeated until they agree. The division can be implemented by a 
shift register whose feedback path corresponds to the coefficients of 
g(x) the generator polynominal of a, the primitive element.* 


IV. HARDWARE IMPLEMENTATION FOR SINGLE CHARACTER CORRECTING CODE 


The implementation is described through the example: symbols in 
GF (2*") k = 2, b = 4. The elements of GF(4) are represented as 
binary 2-tuples. Since x? + x + 1 is a primitive polynomial in GF (2), 
division by a can be instrumented by a two-state shift register with 
appropriate taps. The decoder takes form of Fig. 1. The entire re- 
ceived vector is shifted into the data buffer and then the syndromes 
So and S, are computed and stored in two registers Ry and R,, respec- 
tively. So identifies the error pattern. If So is nonzero, it is assumed 
that a single character error of the pattern shown occurred. For each 
character shifted out of data buffer the R, register is shifted once 
with the feedback path connected (which corresponds to a division 
by a). The error pattern Sp in Ro matches that contained in register 
R, when the erroneous character just emerges out of the data buffer. 
This character need only have Sp subtracted from it to complete cor- 
rection. It is possible to do all of this with simple logic circuitry and 
a storage of 2bk bits. 


*No other code, of the same constraint length, which corrects single errors, 
can have a higher rate (that is, fewer check symbols). 
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DATA 
FROM DATA 
CHANNEL SINK 


Fig. 1— Decoder for single error correcting code. Go and G: are timing signals: 
G> is high during the time information characters appear; G: is high when parity 
character appears. 


V. PROPERTIES OF THE SINGLE CHARACTER CORRECTING CODE 


Observe that for every b character one must decide, based on the 
current syndromes of 2k bits, the error pattern and the location of 
the error. Since there are b possible error locations one needs at least 
[logeb] = k bits for identification if 2*1 < b, and as there are 2” error 
patterns in a character (including the no-error pattern) that calls for 
k bits of information; thus the lower bound on the number of syn- 
drome bits is 2k which shows the code is optimal* when b > 27-4, 

Notice that since the syndrome is reset to zero after correction (cor- 
responding to the removal of the effect of the error on syndrome) no 
error propagation is possible. 

Interleaving can be applied to this class of codes to achieve a class of 
near-optimal burst-error correcting codes. If the interleaving degree is 


* It is optimal in the number of check symbols. 
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m then two adjacent characters in the original code are separated by 
m — 1 characters or L = k(m — 1) binary digits. A burst of length L 
+ 1 bits will never affect two adjacent characters of the original codes; 
hence the interleaved code is capable of correcting an L + 1 bit burst. 
Since the original code has a storage requirement of 2bk bits, the in- 
terleaved code requires, at most, 2kbm bits. Observe that to correct a 
particular character one need not store the last character of the other 
m — 1 interleaved codes; therefore the storage requirement is 2 bkm 
— (m—1)k = 2 bk(1 + L/k) — L. 

The required guard space is simply one less than the constraint 
length.* This can be seen by observing that in correcting a burst the 
syndrome must be set to zero when the last character in the burst is 
corrected. It follows that the guard space must always be shorter 
than the constraint length. Table I compares some members of this 
class of codes with some Hagelbarger burst-error-correcting recur- 
rent codes as well as the Berlekamp burst-error-correcting recurrent 
codes (assuming Massey’s decoding algorithm) .*-? Table I shows this 
class of codes requires less guard space and hence is more powerful. 


TABLE I—CoMPARISON OF BuRST CORRECTING CoDES 


Decoder cost* Guard space 
Rate Burst A B Cc A B Cc 
R= 1/2 20 44 _— 61 61 — 60 
19 —_— 60 58 — 61 57 
R = 2/38 21 132 —_ 112 170 — l11 
22 — 120 —_— — 122 —_ 
R = 3/4 20 183 —_ _— 223 — _— 
21 — 168 156 — 171 155 
R = 4/5 20 326 _— —_ 384 — _— 
21 —_ 225 _— _ 229 _— 
22 _— — 219 — _ 218 


* More precisely, the number of shift registers. 
Note: A — Hagelbarger codes. ; 
B- poms type B2 burst-error correcting codes modified by inter- 
eaving. 
C - Single-character correcting codes modified by interleaving. 


Although the number of shift register stages is not an accurate meas- 
ure of decoder cost, it is seen that, except for the rate of 14, Hagel- 
barger’s codes generally cost more, especially at higer rates. Although 


*In our case the constraint length is equal to the storage requirement. 
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the interleaved single character correcting codes are slightly better 
than Berlekamp’s code for correcting type B1 bursts, they are both the 
same (and optimal) for type B2 burst correction.? 


VI. GENERALIZATION 


In order to demonstrate the minimum distance of codes with a 
longer constraint length we rely on the following lemma, 


Lemma: If the code with N = N, = 1 has a minimum distance of at 
least d, , then the code with N2 = N, + d, — 1 has a minimum distance of 
at least d; + 1. 


The proof of this lhomma depends on showing that no code word with 
d, or less nonzero elements can satisfy Ax = 0, which is equivalent to 
showing that any d, columns of A form a matrix of rank at least d, 
and thus equation (2) can be satisfied only by x = 0 among all x with 
weight d, or less. 


Proof: By the assumption that the code with N = N, has minimum 
distance d, , there must be at least d, nonzero elements in the first Nb 
symbols of x. Assume that there are just d, and no more, as well as no 
additional nonzero symbols in the rest of the constraint length Nz . 
We write x’ as a d, dimensional vector consisting of only the nonzero 
elements of x. Accordingly A’x’ = 0, where A’ is a N, by d, matrix with 
columns corresponding to the elements of x’. We must choose d, rows of 
A’ which have a nonzero determinant. Assume for the moment that 
none of error locations are zero. We then choose the bottom d, of A’. 
We have a d, by d, array of nonzero elements. Each of the matrix 
element is guaranteed to be nonzero by the fact that all the nonzero 
elements of x lie within the first N,b elements and consequently A’ can 
only have no zeros in the d,th or lower rows. 

We now take the d,; by d, array and divide each column by the first 
element. The first row is now all 1’s and the kth row is 


BiG ae k — 1) — Bisea¥i 
B;; Bivi 
= Cams wae cao ae (5) 


We next divide each row by al (#4) (4) (6-41)/21 thus obtaining a Van- 
der-Monde matrix. The determinant of the matrix cannot be zero since 
the quantity 


af ODay (6) 


736 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1969 


is unique for each column. If q of the locators are zero, we choose the 
bottom d, — gq rows and those q rows which correspond to the 1’s in the 
q zero locator columns. This determinant is nonzero by the same logic. 
Consequently, the equation A’x’ = 0, can only be satisfied by x’ = 0, 
which contradicts our assumption. Therefore, the assumption is untrue 
and the minimum weight code word which satisfies (2) must have 
weight d, -++ 1 or larger, therefore providing the lemma. 

We now prove the general result by induction. When N = 1 the 
matrix 1 obviously has rank 1 and thus the minimum distance is 2. 
To obtain a minimum distance of 3 we need to add 2 — 1 = 1 to B, 
thus N = 2. Each time we increase the minimum distance by 1 we 
increase NV by d — 1, thus 


d-2 

N=1+ Di= G72) 1 
a=1 

2N — 4 = d(d — 8) for any value of d = 2. Since this is a lower bound 

on d we can be assured that for any d = 2 we can build a code with 

2N — 4S did — 8). 

The field must be of sufficiently large size so that all the “locators” 
of (6) be different. This can be done if the field has at least (b — 1) 
[(d — 2) (d — 3) + 2]/2 nonzero elements. Thus for example we 
could build a code which corrects two errors (d = 5), has a rate of 
3% (b = 4) with a 16 element alphabet. The constraint length would 
be Nb = 7 X 4 = 28. This could be implemented by using 4-tuples as 
symbols with an overall constraint of 112 binary digits. 


VII. ALTERNATIVE CODE 


It is also possible to obtain a minimum distance of N + 1 if one uses 
only a certain set of symbols as locators. In the previous section one 
needed a field with at least (b — 1) [(d — 2) (d — 3) + 2]/2 nonzero 
symbols. In this section we show that N can be made equal to (d — 1) 
if one is willing to use symbols from a much larger field. For this pur- 


pose we define B by: 
1 re 1 
1 2 b 
1 1 2 ae, Te Yo 
B=| 67: Bre °°: By, 


rea ens sak Ba 
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j-2 a on 

i=1 2 
where y; is the locator and @ will be defined later. As in Section VI 
we wish to show that 


Ax = 0 (7) 


only if x has d or more nonzero elements, given that there is at least 
one nonzero element among the first b. Suppose there exists a code word 
x with weight d — 1 or less such that (7) is met. Then take the N by 
d — 1 matrix consisting of the d — 1 columns of A corresponding to the 
nonzero components of x, and call it A’. The determinant of A’ is a 
polynomial in 8, where the lowest power of 6 is no lower than that found 
along the main diagonal. This comes about because the right-hand 
columns are shifted down and thus contribute least to the power of 8 
in the bottom rows. The highest power of 8 is shown in the appendix to 
differ from the lowest power by no more than: 


hell 


If we call the difference between the highest and lowest power of 8, 7, 
then the determinant can only be zero when @ is a root of an rth order or 
smaller polynomial with coefficients from the field containing the y,’s. 
Consequently we can choose 8 from an r + 1 order extension field such 
that it is a root of an irreducible r + 1 order polynomial. Then it cannot 
be the root of a polynomial of degree 7 or less, and the determinant 
cannot be zero. Consequently x = 0 for (7) to be met. 

We have now defined a as a member of the field with g’*? elements 
where GF(q) is the locator field. In other words our symbols are 
taken from a q™*! element field and the locators a; are confined to the 
q element subfield. For example, suppose we require a double-error- 
correcting code (N = 4) with a rate 34. We use the four element field 
as locators, and since r + 1 = 4 the symbols must come from a 4* = 
256 element field. This results in an overall constraint length of 
8x 4x 4= 128 bits. 





VIII. IMPLEMENTATION 


- The encoding can be accomplished by the standard convolutional 
encoder. The decoding presents the same problems as the decoding of 


* [+] is the symbol for the integer part of -. 
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a nonbinary Bose-Chaudhuri-Hocquenghem code. One takes increas- 
ing estimates of the number of errors within a constraint length and 
solves for the error locations and values. If the estimate is incorrect 
there is no consistent solution for the error values. The problem of 
error propagation can be easily handled by introducing a second set 
of syndrome calculators which are not changed when errors are cor- 
rected. A guard space equal to the constraint length, with no errors, 
will then produce zeros in all these syndromes, indicating that there 
have been no errors in the last constraint length. 

In both coding and decoding, symbols must be added and multi- 
plied. This is easy to implement when the symbols are represented as 
binary k tuples. Addition is just bit by bit modulo two addition; 
multiplication by a (the primitive element) is accomplished by a 
linear shift register.* 


IX. CONCLUSIONS 


We have presented a class of algebraic convolutional codes. These 
codes can be compared to other convolutional codes, such as Robinson 
and Bernstein’s self-orthogonal codes.* The Robinson—Bernstein codes 
are binary error correcting codes which have a constraint length bounded 
by N = [(2# — #)(6 — 1) + 1b. Our codes have a constraint length 
bounded by N S (2t? — t + 1)b, which is less than the above for 
b = 2. Ignoring the fact that our code is a character error correcting 
code, our code is more powerful than the Robinson—Bernstein code. 

If one takes account of the fact that characters are several bits long, 
one can still make a comparison by taking the constraint length of our 
code as 


N Ss (20° — t+ 1)b({ nl(b — 1) — 3¢ + 1) + O)} + 1), 
[-] integer of. 


One now finds that our code is more powerful than the Robinson- 
Bernstein code for sufficiently large b. More specifically, if ? < 2°/16b 
our code is more powerful. The Robinson-Bernstein codes have the 
advantage that they can be decoded by majority logic devices while we 
require much more complex devices. 

The codes described in this section may be useful when one desires 
a high rate code. Here the amount of storage needed at the encoder 
can be made equal to the redundancy per constraint length. The 
decoder must store the entire constraint length in order to make cor- 


CONVOLUTIONAL CODES 739 


rections but the part of storage needed to compute the error locations 
and values is only equal to the redundancy. The disadvantage is that 
the decoder must perform finite field multiplication, division, and 
addition. 

The code is also capable of correcting erasures; decoding is much 
simpler in this case. Therefore it may be advantageous to let the 
k tuples (which represent symbols) be code words in a binary sub- 
code. The subcode then could be used to detect binary errors (produc- 
ing an erasure), and the convolutional code used to correct these 
erasures. This 1s a convolutional version of Forney’s concatenated 
block codes.° 

The decoding algorithm is also simple for any code of distance four 
or less. This includes the single-error correcting code, the double- 
erasure correcting code, and the single-error-plus-single-erasure code. 
The decoding complexity is comparable to that of Hamming codes. 


APPENDIX 


Bound on the Powers of B 


We wish to bound the difference between the maximum and minimum 
power of # in the determinant of A’. The matrix will be made up of 
columns taken from B or columns from B shifted downward. This 
shifting downward of columns is the only parameter which effects our 
bound. Therefore we take Z; as the amount that the zth column is 
shifted; the 7th column is headed by Z; zeros. By ordering we make Z; a 
nondecreasing function of 7. We can assume that Z; < 7; otherwise this 
would put a zero on the main diagonal and produce a zero determinant. 
One could then take the largest nonsingular upper left minor A” and 
write A’ X"’ = 0. The only difference here is that A’’ has smaller size; 
but the bound will still be valid. 

The determinant of the matrix is given by 


2 U bi .6t4) (8) 


where oc is a permutation of the numbers 1 to N. The term },,,;;) is 
zero if a(t) S Z; ; otherwise it contains 8 to the {[o(7) — Z; — 2] 
-[o(7) — Z; — 1]}/2 power. For any given o the product of (8) contains 
B to the 


N 


5 2 [o(@) — Z; — 2][o(%) — Z; — 1] (9) 
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power, provided that for all 7 o(t) > Z;. Equation (9) can be re- 
written 
N 


4 Di oi) — 804) — 2H)Z, + Zs + 22; + J). (10) 
t=1 
The only term which depends on o is 


— > o(AZ; . (11) 


t=1 

The minimum power of £ in (8) is given by (10) when we minimize 
(11). Since Z; is nondecreasing the term which minimizes (11) is 
clearly o(z) = 7. Furthermore, the only other terms of (8) which con- 
tribute to this lowest power of 8 come from o’s which permute the 7’s 
over regions where Z; is constant. 

The difference between the maximum power of 8 and the minimum 
power is 


N N 
D= d%071Z,—-— Min >> o(dZ; 


i=] g,0(t)>Z4 i=1 


Max 2) [i — o(@)]Z, = > (1 — omin(t))Z; - (12) 


o,c(i)>Zi t=1 

The minimization in (12) is accomplished by assigning the smallest 7 
to the largest Z;, subject to the constraint that a(t) > Z;. This can be 
done by starting with Zy and assigning the smallest possible integer 
to o(N), namely Zy + 1. The term Zy_; then receives the next small- 
est integer which is still free, and so on. This is the minimum as any 
other acceptable o can be converted into this omin by a sequence of 
pairwise permutations which do not increase the sum. In Fig. 2 we 
show an example of the assignment on a square array where the 0’s 
indicate the o which minimizes the sum and the z’s that which maxi- 
mize it [o(t) = 2]. The 0’s in Fig. 2 are not exactly as described above 
but are permuted somewhat above equal value Z,’s for reasons which 
will be clarified in this appendix. 

We now show that we can increase various Z,’s to Z/’s so that Ai = 
Z14, — Z S 1 (in Fig. 2 we increase Z, , Z; and Zs) and D will not de- 
crease. When we change the Z; , omin(z) does not change. When A; 2 1, 
our construction of min(Z) gives omin(4) = Ziar < 2+ 1; thus omia(2) S 4, 
and t — omin(t) 2 0. Once we have the new Z/ with Z/,, — Z/ S l we 
write D’ = D in terms of a sequence 


I 


a= in — = 
1 
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Fig. 2— Bounds on the minimum and maximum powers of 8 in the determinant 
A. 


of 
Since 
i-l1 
Zi = DAs, 
721 
then 


i-1 


Di = > [i — o(2)]Z = > fi -— @] > At 


N 


DA! DY fo] 


7=1 w=7+1 


= Saf WoW titD_ al ag 


j= 2 i=j4+1 


We observe that our choice of «(2) gives 


> (i) : masa roe [N ee q[X—i+8 + z,| ; 


i=74+1 i=Zj+2 
when Aj = 1, 


and (18) becomes 


DAW — G-1- 2). 


For each value of Z;(0, 1, 2, ---) there can be only one nonzero AJ ; if 
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one takes K to be the number of nonzero A} , then 


Di = DW ff - 1-8, 


where f, is that value of j for which Z; = k and Aj = 1. We now show 
that 


YW = Wh-1- 8 s YW = dh — Dv. 


~=Jo 


The terms on the right are all nonnegative and for every & on the left 
there is an 7 on the right with 7 = f, . For that term f, S fo + &, asf, 
cannot increase faster than k. 


f,-1-—-ksf,—1, 
(NV _ fe) fe eet ky s (NV aa fi fo a 1). 


D's SW - df - 1 =(- 1) Ao BA 


t=fo 


IA 


(14) 


The integer value of fp which maximizes this is 


fo = | +1, [-] = integer part of- 


re ET]—}  ow 


This bound on D can actually be achieved for any N. 


and 
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Error Rate Considerations for Coherent 
Phase-Shift Keyed Systems with 


Co-Channel Interference 


By V. K. PRABHU 
(Manuscript received July 23, 1968) 


In this paper we present a theoretical analysis of the performance of an 
m-phase coherent phase-shift keyed system in the presence of random 
gaussian noise and interference. An explicit expression is given for the 
probability of error of the phase angle of the received signal; we show that 
this probability of error can be expressed as a converging power series. 
We show that the coefficients of this series are expressible in terms of well- 
known and well-tabulated functions, and we give methods of evaluating 
the character error rates of the systems. We also show that this error rate 
1s minimum when all the interference power ts concentrated in a single 
interferer, and that it attains its maximum [Pmlmax when the total inter- 
ference power ts equally distributed amongst the K interferers. The limit- 
ing case when Ki goes to infinity is considered. The cases of K = 1, and 
m = 2, 4, 8, and 16 are treated in some detail, and the results are given 
graphically. The usefulness of the results presented in this paper is that 
the designer can have at his disposal very simple expressions with which to 
evaluate the performance of any given Coherent Phase-Shift Keyed sys- 
tem when the received signal is corrupted by both interference and random 
gaussian noise. 


I. INTRODUCTION 


The performance of coherent phase-shift keyed (CPSK) systems 
has been investigated by many authors;?* in the transmission of in- 
formation the CPSK system has been shown to be one of the most 
efficient techniques for trading bandwidth for signal-to-noise ratio. 
However, the type of noise considered by these authors is almost 
always limited to be random gaussian noise although most authors 
admit that interference other than normal noise must be considered 
in the design of any modulation scheme for digital transmission. 
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Consider the following situation. In the frequency bands above 10 
GHz where the signal attenuation resulting from rain storms could 
be very severe, close spacings of the repeaters are almost always 
mandatory for reliable communication from point to point and for 
all periods of time.* In such cases the problem of interference may be 
much more important than the problem of noise in the optimum detec- 
tion of the desired signal; hence it is very desirable to evaluate the per- 
formance of a CPSK system with co-channel and adjacent channel in- 
terference so that, for the selection of an optimum transmission scheme, 
comparative advantages of CPSK over other broadband modulation 
techniques (like FM) in combating interference can be determined. 

We consider in this paper the performance of a CPSK system when 
the received signal is corrupted by both interference and random 
gaussian noise.* We first discuss binary (2-phase) and quaternary 
(4-phase) CPSK systems and show that exact expressions can be 
obtained for their probability of error P,,. These expressions are in 
the form of infinite power series which are shown to converge for all 
values of signal-to-noise ratio and for all signal-to-interference ratios 
above a certain level determined by the system. For m = 2 and 4, 
these error rates are calculated and the results are given in graphical 
form. 


For m = 3 and for m > 4 we show that exact expressions for P,, 
are very complicated functions of signal-to-noise ratio, and signal-to- 
interference ratios; in this paper we only indicate how these expressions 
can be obtained. However, we do obtain expressions for upper and lower 
bounds to P,, and show that the difference between these two bounds is 
a monotonically decreasing function of signal-to-noise ratio, signal-to- 
interference ratios, and the number m of phases used in the system. 
For m = 4, signal-to-noise ratio p’ = 5 dB, and for signal-to-interference 
ratio 1/L’? = 20 dB, we show that this difference is less than 5 percent, 
and that the upper bound can be used as a good approximation to Pn. 
For m = 8 and 16, we calculate these upper bounds and we present the 
results graphically. 

For a given amount of interference power, we show that the char- 
acter error rate is minimum when all the power is concentrated in 
a single interferer. If the total number of interferers is K we also 
show that the error rate P,, reaches its maximum [P,]max when the 
interference power is equally distributed among all the interferers. It 
~ * The word “noise” indicates random gaussian noise corrupting the desired 


received signal. ; f 
t We use the notation b = a dB if 10 logub = a. 
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follows that [Pm]max is a monotonically increasing function of K and 
attains its maximum when K goes to infinity. We show that the case 
of K going to infinity can be treated in a simple manner. 

For the computation of error rates P,, (or upper bounds to P,,, m 
> 2) it is necessary to calculate the central moments pe,’s of a certain 
random variable y defined in terms of the K interfering carriers. For 
large values of K the conventional method of evaluating pe,’s can be 
rather tedious; we give some simple methods of evaluating these mo- 
ments. 

In conclusion, this paper determines the performance of m-phase 
CPSK systems for the important case of signals corrupted by random 
gaussian noise and interference. The cases of m = 2, 4, 8, and 16 are 
treated in some detail. 


II. PHASE ANGLE DISTRIBUTION IN CPSK SYSTEMS 


Let us consider an m-phase CPSK system. We assume that there 
is a steady received signal* which is corrupted by random gaussian 
noise and interference. The gaussian noise is assumed to have zero 
mean and variance o*?. The signals under consideration consist of 
phase-modulation pulses of specified width transmitted at a known 
repetition rate; we assume that there are K interferers, each interferer 
having the same form as the signal. 

If we assume that each signal transmitted has a duration 7’, the 
received signal waveform in the absence of noise during the Nth in- 
terval can be represented as 


sy(é) = (28) cos (wot + 0), NT StS (N + 1)T, (1) 


where S is the received signal power, w, is the angular frequency of the 
signal, and @ will have some value in the discrete set 27k/m,O Sk S 
m — 1, corresponding to the Nth message. All m messages are assumed 
to be equally likely. In the absence of noise and interference, the set of 
m possible received signals is described by a set of m equally-spaced 
vectors in the complex plane as shown in Fig. 1. The noise and inter- 
ference corrupting the signal distort the signal both in amplitude and in 
phase; a zero-phase signal (corresponding to k = 0), as disturbed by 
noise and interference, is also shown in Fig. 1. 

If we now assume that power in the jth interferer is J;, the jth inter- 


*In this paper we do not consider the effects of fading on the error rates of 
CPSK systems. The effects of fading can usually be accounted for by a further 
integration of error rates obtained in this paper.* 
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Fig. 1 — Phasor representation of CPSK signals for m = 4. 


ferer as received during the Nth interval can be represented as* 
iv (t) = (21,)' cos {wt +0; +uj},NTSts(N+)T ~~ (2) 
where w, is the angular frequency of the jth interferer, 0; 1s some value 


in the discrete set (2x/m) k, 0 S$ k S$ m —1, and the probability density 
Tu;(u;) Of ws is given by 


1 
— <= y. 

Maye Sah see (3) 
0, otherwise. 


Since the K interferers are assumed to originate from K different 
sources, it is reasonable to assume that all p,;’s are statistically in- 
dependent of each other and are also independent of gaussian noise 
n(t). : 

The total received signal during the Nth interval can then be writ- 
ten as 


ry(t) = (28)* cos (wot + 6) + >» (21,)* cos (jt + 6; + w;) + n(d), 


ATStsWW+)T 
where n(é) has zero mean and variance o?. 

Assuming that the receiver used in the system detects only the 
phase angle ® of ry(t) and does not respond to its amplitude varia- 
tions,j we can write® 

* We assume that all t;y’s, 1 S 7 S K, are in the passband of the CPSK 
receiver used in the system. ; ve 

'+ This can be achieved in practice by using an ideal limiter at the front end 


of the receiver. If A(t)e?*™ is the input to an ideal limiter, its output is given 
by Ave?*“? where Ao is a constant. 
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ee ee 
: oe — aot (5) 
where f(t) is the Hilbert transform of ry(t) and is given by 


r ». if T(r) 
fy() = ye ae dr. (6) 
Let us write 


n(t) = I, cos (wot + 6) — I, sin (wot + 8). (7) 


We can show® that J, and J, are two independent gaussian random 
variables each distributed with mean zero and variance o7.* From 
(4)—(7) , we can now show that 








&’= 9+ tan’ 
K 
I, + 2) (21;)* sin [(@; — w)t + 6; — 9 + #5] 
SSS SSS (8) 
(28)! + I, + > (21,)* cos [@; — wo)t + 0; — @+ ui] 
Let us now write 
aes 
p= c ? (9) 
Pe 4. 
(28)? = 0, (10) 
Fee's 
6= 2) R; sin), , (12) 
EK 
7 = DD R; Cos A; ) (13) 
where 
3 fhy¥ 
Ri; = (4) ? (14) 
and 
d; = (w; = Wo)t + 0; — 0 + oy; . (15) 


*It is assumed that the spectrum of gaussian noise is symmetrical around the 
frequency w = wo. 
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Let us also denote the set {A, , Ay, --- , Ay, °° , Ax} of random 
variables \,’s by 2. 
We can now write eq. (8) as 


v+6 


bm = 6 tan”? ——_—_—— 
ee Gang 


(16) 


where 6 and y are functions of }. 
If K is a finite number, we can show’? that the probability density 
Py(n) can be represented as* 


1 is) . K 
pn) = 5- fe TI Jor) at, (17) 
T Ja 7=1 


where Jo(x) is the Bessel function of the first kind and of order zero. 
For K = 1, we can show that? 


oe eee 
pi(n) = 4 (Ry — 7°)?’ 


0, otherwise. 


ln| SR, (18) 


For K = 2, p,(y) can be expressed in terms of elliptic functions, and 
for K > 2, no closed form expressions can be obtained for p,(y). In 
Ref. 10 p,(y) has been expressed as a converging sum and has been 
evaluated for K = 10. It is easy to show that 


K 
pn) =0, for || > dR; , (19) 
and 
i) . K 
[pave an = TE Jo(tR). (20) 


III. CPSK RECEIVER 


An ideal CPSK receiver is shown in Fig. 2. The ideal limiter re- 
moves all the amplitude variations of the received signal before it 
reaches the ideal phase detector of the system. We shall assume maxi- 
mum likelihood detection for our analysis of the receiver. Let us as- 
sume that the receiver shown in Fig. 2 has zero-width decision thres- 
holds as shown in Fig. 1. 


* We can also write similar expressions for ps(6). 
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Fig. 2— CPSK receiver. 
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3.1 Error Rates for Binary CPSK Systems 


For a binary CPSK system the set of two possible received signals 
in the absence of noise and interference is shown in Fig. 3. The noise 
and interference corrupting the desired signal distort the signal both 
in amplitude and in phase; a zero-phase signal (corresponding to 
k = 0) as disturbed by noise and interference is also shown in Fig. 3. 

When the message k = 0 is sent, and when the phase angle & of the 
received signal lies in the second and third quadrants of the complex 
plane shown in Fig. 3, an error is made in detecting the received signal. 
For a given p’, and for an arbitrary set of \,’s let us assume that the 
origin of the gaussian noise vector is at point G in Fig. 3. When the 
terminus or tip of the gaussian noise vector lies in the left half of 
the complex plane (the shaded portion of Fig. 3) an error is made by the 
receiver. Since I, and J, are two independent gaussian random vari- 
ables and since they are distributed independently of ,’s, the prob- 
ability P.(\) that the terminus of the gaussian noise vector lies in the 


QUADRATURE 
—~ AXIS J 
NOISE ~ a Ig 
\\ ie we JAG le - 
| 


~L 
V2T [INTERFERENCE 
| 


_ Fig. 3— Phasor representation of CPSK signals for m = 2. I- and J, are the 
in-phase and quadrature components of gaussian noise corrupting the desired 
received signal. 
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left half of the complex plane is given by* 
PA) = Pr[—«o <I, < ~] 


Pr | -« ae -(2s) 4 > 1) cos ») | 


1 ~{ ante Ear cos \ 2 
= ae a exp (—£/20%) dt. (21) 
We can show from Equation (21) that 
P2(d) = % erfe [o + py], (22) 
where 
erf (x) = = ) exp (—u”) du (23) 
Ww vo 
and 
erfe (x) = 1 — erf (2). (24) 


The character error rate P. for a binary CPSK system is, therefore, 
given by 


P, = E[P,(d)], (25) 


where E[P.(A)] represents the mathematical expectation of the random 
function P(A). 
From Equations (22) and (25) we have 


P, = 3£Eferfc {p + p7n}]. (26) 
We now note that we can write! 1? 
i) rd 
erfe [x + z] = erfe [x] + 4 exp (—2”) > (—1)'He-a(e) F : (27) 
t=1 : 


where H,,(x) represents the Hermite polynomial of order n. The series 
converges for all values of x + z such that 


zt2e20. (28) 
From Equations (26) and (27) we have 


Py = $erfe (6) + “exp (2) Do (-D'Haalo) FBG). 29) 


* The notation Pr[a < x < b] denotes the probability that the random vari- 
able x satisfies the inequality a < « < b. It may also be noted that P2(A) is a 
conditional probability conditioned on A. ~ 
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Let us denote by p, the nth central moment of 7.* It can then be 
shown that?° 


Moevr. = 0, £=0,1,2,-:-. (30) 
We, therefore, have 
Py = $ erfe (0) +75 exp (— a!) So Haeale) Pasa G1) 
2 2 3? — 2f-1 (2£)! 2he 


The series given in Equation (31) converges for all values of p and 
R,’s such that 


p+py20 for all . (82) 


From Equations (18) and (32) we can show that the series converges 
when 


Q 


IIA 


1, (33) 
where 


o= SR. (34) 


i=1 
Equation (84) states that the sum of the normalized amplitudes of all 
the interfering carriers may not exceed the normalized amplitude of 
the desired signal. This is not a very stringent requirement and it is 
almost always satisfied when low error rates are desired. 
Since we also know that* 


1 K K 
i > p, = UR", (35) 
7=1 7=1 


when Equation (383) is satisfied, we have 


fi) <()" on 


The expression S/I; denotes the signal-to-interference ratio of the 
jth interfering carrier. 
When there is only one interfering carrier we can show that, 


2)! 
bot = ee TE ’ (37) 


_ *Equation (85) states that the arithmetic mean of a set of real variables 
is always greater than or equal to its geometric mean. 
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a as 


P, = §% erfe (p) + *y exp (—p) x aoa Tikal Ley 


The series in equation (38) converges for all signal-to-interference 
ratios such that 


and equation (31) can be written as 


PS S-1; (39) 


The values of Ps have been calculated from equation (38) and the 
results are given in graphical form in Fig. 4.* 

Notice that we need to calculate only the even order moments 
fon’S Of the random variable 7 in determining Pz from equation (81). 
Some methods of calculating these moments are given in Appendix A. 


3.2 Error Rates for Quaternary CPSK Systems 

Let us now consider a 4-phase CPSK system. For this system the 
set of four possible signal phasors and the four optimum decision 
thresholds are shown in Fig. 5. A signal phasor (corresponding to 
k = 1) as disturbed by noise and interference is also shown in Fig. 5. 

For a given set of ,’s let us assume that the gaussian noise is rep- 
resented by a vector from the point G. If the message k = 1 is trans- 
mitted, an error is made if the received phase angle lies in areas 
marked 1, 2, and 3. The phase angle of the received signal will lie 
in areas marked 1, 2, and 3 if the terminus of the gaussian noise 
vector lies in this area of the plane.'* 

We notice that 


K 
= (28) sin7 + S (2I,)* sin (= = as) ) (40) 
and 
= (25)! sin 7 + d @I,)} cos (E+ Mi): (41) 


Let us denote by Tn1,x2 » °** 5 en(A) the probability that the terminus 
of the gaussian noise vector lies in area 


kit 
i=l 
* The results obtained in Fig. 4 indicate that the error rates obtained in Refs. 
13, 14, and 15 agree well with those obtained in this paper. 
'tThe notation U 714; denotes the union of all elements of the set {k1 ke ,+++,kn}. 
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Fig. 4— Error rates for a 2-phase CPSK system with one interferer. 


We can show from Fig. 5 that 


II, (4) = 4 erfe |» sin 7 + p 3 R; cos +; i: (42) 


fIa> 


TIz,3(4) = 4 erfe |» sin 7 + p > R, sin ( —~ +; if (43) 
j=1 


tS 


and 
11,() = 4 erfe |» sin 7 +p a ce (# + ») 


-erfe | psn + p > R, sin (# +n) (44) 
The probability P,(4) of an error due to noise is, therefore, given by 


P,(d) io TI; ,2(d) i TIe,3(d) = TI2(d). (45) 
The probability of an error due to noise and interference is therefore 
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Fig. 5—- Phasor representation of CPSK signals for m = 4. Iu and I, are two 
orthogonal components of gaussian noise. 


given by 


Py= E[P,(Q)]. (46) 
From equations (27), and (42) through (46) we can show that 


P, = erfe E sin | —4 erfc? E Rie «| 


4. 


ct oy exp (—¢ sin” ae — erfe |» sin Al 


4 


i) . 2¢ 1 : 
; dX Basal p sin r) ay Mag — fe exp —2p" sin” 7) 
=i : 


4 
C-) cs Haea(p sin TV Hai(p sin *) 
2¢f+7) 
° > > (26)! (23)! p Mae,2; (47) 
where py, 2;’8 are given by 


1 2r 2a 27 K 2¢ 
M3t,23 = —./ dé, i db, +++ [ die, R; cos s} 
(2r)” Yo 0 0 i=1 
K 25 , 
ae OE AD Resin 0) oe 


ful 


48) 
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For a given set of R,’s, ux,.;’8 may be evaluated from equation (48). 
For K = 1, we can show that 


= Rrire (26)! Qs)! : 
2 st (e +s)! 
For K = 1, we have calculated P, from equation (47) and the 
results are presented in Fig. 6. 
We can again show that the series given in equation (47) con- 
verges for all values of p and R,’s such that 


(49) 


* 
M2t,2s8 


: 1 
< in x = —_. 
ss 4 VQ (50) 
For K = 1, equation (50) becomes 
S/I = 2. (51) 


Equation (50) is usually satisfied by systems encountered in practice. 
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Fig. 6 — Error rates for a 4-phase CPSK system with one interferer. 
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3.3 Error Rates for Multilevel CPSK Systems 


In this section we shall investigate the performance of a multilevel 
(m = 3) CPSK Systems and indicate a method in which an exact 
expression can be obtained for the probability of error of the system. 
This exact expression is a very complicated function of signal-to-noise 
ratio and R,’s; we do not obtain this expression in this paper. However, 
we obtain upper and lower bounds to P,, and show that the difference 
between these two bounds is a monotonically decreasing function of 
p, m, and signal-to-interference ratios. For K =1, m = 4, p’ = 5 dB, and 
S/I = 20 dB, we show that this difference is less than 5 percent of 
the lower bound, and hence the upper bound is a good approximation 
to P,, when low error rates are desired. 

A signal phasor corresponding to k = 0 as disturbed by noise and 
interference is shown in Fig. 7. For a given set of ,’s let us again assume 
that random gaussian noise is represented by a vector from the point 
G shown in Fig. 7. If the message k = 0 is transmitted, an error is 
made if the terminus of the noise vector lies in areas marked 1, 2, and 3. 

We can show that * 


K 
II, ,.(4) = 4 erfe E sin = + p 2, Re sin (= — ») | (52) 


and 


K 
erfc |» sin = +p > R,sin (= + ») | (53) 


TI2,3(A) re m 
j=1 


vie 


The probability of error due to noise is, therefore, given by 


P,(A) = Th,2Q) + Te,s(4) — Ie(A). (54) 


By looking at Fig. 7 we can see that no simple expression can be 
obtained for II,(A) (except when m = 4). II,(A) denotes the probability 
that the terminus of the gaussian noise vector lies in area 2; we shall 
now obtain upper and lower bounds to II,(A). Assume that 


K 
psin — p )/ R; 20 (55) 


j=1 
* Note that 


Tv 


K 
GA = (28)! sin + > (27,)? sin (= = i) 


and 


OB = 28) sin® + ¥ @r)'sin (= +») 


Ee 
m 
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_ Fig. 7— Phasor representation of CPSK signals for m = 8. I, and I» are the 
in-phase and quadrature components of gaussian noise. 


so that II,,.(A) and II,,3(4) are nonnegative for all values of \. If equa- 
tion (55) is satisfied, it is easy to see (see Fig. 7) that 


TI,(A) = 0 for all d, (56) 


and II,(\) reaches its maximum when* 
K 
n= — 2, Ry = —2@. (57) 
F t=1 
For this value of 7 it can be shown (see Fig. 8) that 


1 —Yo —(yt+yo) tan r/m 
ae / exp (—y’/20”) dy / exp (—2"/20") dx 
es 0 


II,(d) = 
or 
1 ie) 5 i (y-vo) tan r/m r i 
TI,(4) = 3 exp (—y"/20°) dy exp (—2°/20°) dx (58) 
vo i?) 
where 
Yo = (28)*[1 — Q]. (59) 

Since we always have 

0 S exp (—27/20’) < 1 for all real z, (60) 


* We can show that » = —Q when all A,j’s are odd multiples of x. 
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Fig. 8 — Computation of lower bound to Pn. 


we have 


t eo 
a(x) = 2222 [Wy — y,) exp (—y'/20") dy. (61) 
Equation (61) can be simplified to* 


_ tana/m. 


T12(A) S Qno = exp [— p (1 — 0)”] 


-[1 — (wr) 4o(1 — Q) exp [p°(1 — 2)” erfe {p(1 — 9)}]. (62) 
From equations (54), (56), and (62) we have | 


TH,,2(4) + Ts) — Qno S Pm(d&) S Th2(d) + Ths). (68) 


Since 
Po =F P(N); (64) 
we can show from equations (52), (53), (55), and (63) that 
Qa — Qn = Pa = 0. (65) 
where 


° T 
Qn = erfe (, sin *) 


2 
—~yz exp 


+ @ 


T co) Has p sin =) 
(—- sin” 4 Dd Hr P Boe» (66) 


= 1 


* For large values of p and small values of 2, Qmo is approximately equal to 


tan r/m exp [—p'(1 — 9)’] 
2r p(l — 2) 
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The series given in equation (66) converges if equation (55) is 
satisfied, or if 


Oi TT 
Q < sin a (67) 


When low error rates are desired, equation (67) must be satisfied. 

Equation (65) gives an upper and a lower bound to P,,; as can be 
seen from equation (62) the difference Qmo between these two bounds 
is a monotonically decreasing function of p, m, and signal-to-inter- 
ference ratios. From equation (65) we have 


Q no PaO 

0 0s, < P. <0. (68) 
For K = 1, R, = 7, and for m = 4, 8, and 16, we have plotted in 
Fig. 9 Qno/(Qm — Qmo) aS a function of p?. From Fig. 9 we see that 
Qmo/(Qm — Qo) is less than 5 percent for p? = 5 dB and for m 2 4. 
We can, therefore, use Q,, as a good approximation to P,, for high values 
of signal-to-noise ratio (p? = 5 dB) and for high values of signal-to- 

interference ratio (1/R, = 10 dB). 


In these cases we then have 
P.,, & erfe (, sin *) 
m 


2 ( 2+ 2 =) = Has(p sin =) 2k 69 
Gai exp \—e" sin’ - ep? Hor. (69) 


k=l 





+ 


For K = 1, and for m = 8 and 16, the values of P,, obtained from 
equation (69) are given in Figs. 10 and 11. The error made in this 
approximation can be estimated from equation (68). 


IV. ERROR RATE AS A FUNCTION OF NUMBER OF INTERFERERS 


Let us now investigate how P,, varies as a function of K for a total 
given interference power. Let us assume that the total interference 
power is some number SL? where 


pia 
Do dy = SL’. : (70) 

f=1 
This power SL? can be distributed among the K interferers in a 
variety of ways; every one of these distributions will in general lead 
to a different character error rate of the system. Let us find out those 
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SIGNAL~ TO-NOISE RATIO IN DECIBELS 
Fig. 9—Qmo/(Qm — Qmo) as a function of p. 


distributions of power (if they exist) which make this character error 
rate a maximum or a minimum. 
4.1 Error Rates for K Interferers 


Let us first consider the case when p > 1 and 2 < 1. In this case the 
series corresponding to P,, (or Q,,) converges very rapidly; let us say 
that the first N terms of the series are sufficient to evaluate P,, to the 
desired degree of accuracy. 

For all ¢ and z, we have 


Fo:_,(@) = 22H o¢-2(z) — 2 (2€ — 2)Ho,_3(z). (71) 


From equation (71) we can show that 


Basal p sin =) = 0, lsjsN, (72) 
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if 


— 3 
hones ee (73) 


> 
IV 


- 
sin — 
m 


If Equations (72) and (73) are satisfied notice from Equations (31) 
and (69) that P,,’s are monotonically increasing functions of ys,’s, 
€ = 1. For a given p, , it can be shown from Equation (18) that y2,’s 
€ = 2, reach their minimum when Q is minimum and they reach their 
maximum when Q is maximum. 

We can then say that P,,’s (or Q,,’s) attain their minimum when Q 
is minimum and that they are at their maximum when 0 is maximum. 

From Figs. 3, 5, and 7 this seems to be true for all values of p and Q 
which satisfy Equation (55). 

Let us now find out when 2 is minimum for a given value of signal- 
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Fig. 10 — Error rates for an 8-phase CPSK system with one interferer. 
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Fig. 11 — Error rates for a 16-phase CPSK system with one interferer. 


to-interference ratio. The signal-to-interference ratio 1/L’ is given by 


= >a. (74) 
jai S 
Clearly © is minimum when 
SSL, LSys K, (75) 
and 


We can then say that the character error rate P,,, is minimum when the 
total interference power is concentrated in a single interferer. 
Now from equations (14), (84) and (74), 2 is a maximum when 


K 3 K 
FEW) --E@)-0 reise on 
7 Lj=1 7=1 
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e is a constant and is the Lagrange multiplier used in finding the 
extremum of 0. 

Solving equation (77) we observe that Q is a maximum or that 
P,, 18 & Maximum when 


SL’ 
K ? 
or that the total interference power is equally distributed among the 
X interferers. 
Let us now assume that K is a variable number. It is clear from 
equation (78) that [Pn]max 18 a monotonically increasing function of 
K. 


IIA 


K (78) 


4.2 Hrror Rates for a Large Number of Interferers* 


Let us now consider the limiting case when K goes to infinity and 
K 
Dy iS BL: (79) 
i=1 


We can show’® that the probability distribution function off 


ul) = Yo CL)! eos fait + 0 + as) (80) 


as K goes to infinity approaches that of gaussian noise with mean 
zero and variance SL? under certain conditions. 
In this case we have from equation (4) 


y(t) = (2S) cos (wot + 6) + y(t) + n(Q). (81) 
Since y(t) and n(t) are independent gaussian random variables their 
sum 
b@) = y¥® + n@ (82) 
is also a random gaussian variable with mean zero and variance SL? 
+ o. 
From equations (81) and (82) we can write 
ry(t) = (28)? cos (wot + 6) + bi) (83) 
where b(t) is a gaussian random variable. 
*The results of this section are applicable for any signal-to-noise ratio and 
any signal-to-interference ratio. 


+ Ruthroff has shown that for K 2 50 the distribution of y(t) can be considered 
to be gaussian in practice for the computation of distortion in PM systems.1¢ 
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The case where ry(t) can be described by equation (83) has been 
considered in detail in Ref. 17;* we can easily determine the deteriora- 
tion in performance produced by interference from the results pre- 
sented in that paper. For example, suppose that m = 4, S/o? = 16 dB, 
and L? = —16 dB. Clearly 


2 
tL = -134B (84) 
and P, from Ref. 17 is given by 
PE 79 X10", (85) 


For the calculation of the effect of interference in CPSK systems, 
we note that we have not shown the validity of the gaussian approxi- 
mation of y(t) for K > 1. However, this assumption seems to be 
justified for large signal-to-noise ratios and small interference-to- 
signal ratios.%® . 

In conclusion, this section gives methods of evaluating character 
error rates of CPSK systems for all values of m and for all values of 
K. It shows that the error rate P,, is minimum when all the inter- 
ference power is concentrated in a single interferer and that it attains 
its maximum value [P]max When the interference power is equally 
distributed amongst all the interferers. We further show that [Pm] max 
is a monotonically increasing function of the number K of interferers. 
We also show that the case, K going to infinity, can be treated and that 
the deterioration in performance produced by interference can be de- 
termined. 


Vv. CONCLUSIONS 


A method to evaluate the character error rates of CPSK systems 
has been presented in this paper. The received signal is assumed to be 
corrupted by both interference and random gaussian noise. When the 
number of interferers is very large it can be shown that the inter- 
ference and random gaussian noise can be combined together to give 
rise to an equivalent noise source having gaussian properties. The 
variance of this random variable is the sum of variance of random 
gaussian noise and total interference power. In this case the analysis 
of the CPSK system can be done by methods presented in Ref. 17. 


When K is a finite number and when m = 2 or 4, exact expressions 


* The results presented in this paper for S/IJ = © are also sufficient to deter- 
mine Pm for a large number of interferers, 
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are given for the probability of error P,,. When m = 3, upper and lower 
bounds to P,, are derived. We show that the difference between these 
two bounds is a monotonically decreasing function of signal-to-noise 
ratio p’, signal-to-interference ratio 1/L’, and the number m of phases 
used in the system. For K = 1, m = 4, p? = 5dB, and1/R, = 10 dB 
we show that this difference is less than 5 per cent, and that the upper 
bound can be used as a good approximation to P,, . 

We then show that for any m-phase CPSK system the character 
error rates can be expressed in terms of the central moments of a 
certain random variable y and that they can be calculated to any 
desired degree of accuracy by using a set of tables or by using a 
digital computer. 

For a total given interference power we show that the character 
error rate P,, attains its minimum when all the power is concentrated 
in a single interferer, and that it reaches its maximum [Py] max When 
the power is equally distributed among all the K interferers. It is also 
shown that [Pm]max is a monotonically increasing function of K. 

The cases of K = 1, m = 2, 4, 8, and 16, have been treated in some 
detail and the results are given in graphical form. The required signal- 
to-noise ratio for any value of signal-to-interference ratio can be 
determined from these figures. 

The usefulness of the presented results is that they provide the 
designer with some relatively simple expressions with which to eval- 
uate the performance of any given CPSK system with interference 
and random gaussian noise. The only quantities he must have at his 
disposal are the central moments of a certain random variable y 
defined in terms of the K interfering carriers. 
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APPENDIX 


Evaluation of Central Moments of 


In the computation of character error rates for CPSK systems it is 
necessary to calculate the even order moments of the random variable 
7; We shall give in this section two alternate methods to evaluate these 
moments. 


766 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1969 


By definition p2, is given by 


1 2r 27 2a K 2n 
Han > 72 \K de, dbz Srene / ads} 3 R; cos | . (86) 
(27)” Yo 0 0° j=1 


By the multinomial theorem 


| 3.8, 00s 0, |” = ny! TF Ry” cos" 6, (87) 


i 
7=1 


where n,’s are positive integers such that 


K 
Yn; = Qn. (88) 
7=1 
Since 6@,’s are statistically independent of each other, and since 
Hors: = O for all £, we have from Equations (86) and (87)* 


~- 3 Fin TI eon (| (89) 


t=1 


where n,’s are. a set of even positive integers satisfying Equation (88). 
Even though equation (89) gives an exact expression to evaluate 
Pan's, it can-be rather tedious to’ evaluate pen’s from equation (89) 
for large values of n and K. We shall therefore give an alternate 
method to evaluate the central moments of the random variable 7. 
It can be shown that the probability density function p.(y) of the 
random variable 7 can be expressed as?° 


p(n) = bla +2 ¥ cos? I J (ee |. (90) 


The 2nth moment of 7 can be ao ae as 





bow = | pi) de. (91) 
-2 


From equations (90) and (91) we can show that 





— 2” +1 
Hon = & (et t2 de 1) 


{0 (82) | Sco gee) 


* For K = 1, equation (89) reduces to equation (37). 
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It can be seen that the infinite series appearing in equation (92) 


converges rapidly for all values of R,’s; we need take only a finite 
number of terms from equation (92) to estimate po,’s. It is, therefore, 
easier to evaluate po,’s from equation (92) than from equation (89) 
when there are a large number of interferers, and we have to take a 
large number of terms in estimating P». 
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Spectral Density Bounds of a PM Wave 


By V. K. PRABHU and H. Ek. ROWE 


(Manuscript received November 5, 1968) 


In this paper we derive upper and lower bounds of the spectrum of a 
sinusoidal carrier phase modulated by gaussian noise having a rectangular 
power spectrum. It has been found in practice that such a random process 
adequately simulates for some purposes, a frequency division multiplex 
signal, a composite speech signal, and so on. We show that these upper and 
lower bounds of the spectrum are very close to each other if the root mean 
square phase deviation of the carrier ts even moderately high. Also, a simple 
method called the saddle-point method can be used at all frequencies f to 
estimate the spectrum with less than ten percent error. We also show that 
the results obtained from the quasistatic approximation, often used in 
such cases, are too small for large f, and that this low-frequency approxi- 
mation cannot be used in cases where the behavior on the tatls 1s 
of tmportance. 


I. INTRODUCTION 


It has been found in practice that a bandlimited random gaussian 
noise having a rectangular power spectrum adequately simulates for 
some purposes a wideband composite speech signal, a frequency divi- 
sion multiplex baseband signal consisting of a group of single side- 
band carrier telephone channels, and so on.’ In the design of commu- 
nication systems, the spectral characteristics of a sinusoidal carrier 
phase modulated by such a baseband signal are of great interest; 
various methods have been used in recent years to estimate this 
spectrum for large and small values of mean square phase deviation 
of the wave, both close to and far from the carrier frequency (that is, 
in the principal part of the spectrum and far down on the tails of the 
spectrum respectively) 2-8 

It has been shown that the spectrum may be expanded as an 
infinite series of weighted convolution terms.?>7*8 This series may 
be used to estimate the principal part of the spectrum (close to the 
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carrier) for small or moderate index (that is, small or moderate values 
of rms phase deviation). However, for large index, or far down on 
the tails for small index, too many terms would have to be included 
if this series is to be used directly. 

The simplest analysis—often called the quasistatic approxima- 
tion—yields a gaussian spectrum for large-index angle-modulated 
waves” *® in most cases.* This approximation fails far out on the 
tails of the spectrum; a careful investigation has been given in only 
a few cases.” We obtain below upper and lower bounds for the spec- 
trum of an angle-modulated wave with white, band-limited phase 
modulation; far out on the tails the spectrum far exceeds that pre- 
dicted by the quasistatic approximation. 

This problem is of interest in considering interference between 
two (or more) phase modulation (PM) systems in neighboring loca- 
tions. Consider the following situation. In the frequency bands above 
10 GHz, where the signal attenuation due to rain storms could be very 
severe, close spacings of the repeaters are almost mandatory for 
reliable communication from point to point.® In such cases the prob- 
lem of interference between neighboring systems may be much more 
important than the problem of noise; the system may thus be in- 
terference limited. In order to combat this interference it is very 
likely that broadband modulation techniques like PM [or frequency 
modulation (FM) ] or pulse code modulation (PCM) will have to be 
used. In order to investigate the effect of this interference between two 
co-channel PM (or FM) waves it is necessary to evaluate the spec- 
trum of a PM wave, so that the parameters (such as rms phase 
deviations) of the two PM systems can be properly chosen to keep 
‘the interchannel interference below a certain desired level. 
| We first obtain an expression for the covariance function of the 
PM wave. From this covariance function we then derive an expres- 
sion for the spectrum of the PM wave and show that it can be 
expressed as an infinite series. This series has been evaluated for 
certain values of rms phase deviation N.° 

We then show that the autocorrelation function of the PM wave 
is analytic at all points in the finite part of the complex plane deter- 
mined by the argument of the autocorrelation function. In determin- 
ing the Fourier transform of the autocorrelation function we change 
the path of integrationf so that the contour is very close to the path 

* For exceptional cases see Ref. 7, Ch. 4, pp. 131-135. 


. ¥ The method used in this paper to evaluate the spectrum is a close relative 


of: the method of steepest descent (or the saddle-point method) used in evaluat- 
ing certain kinds of integrals.» * *% 
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of steepest descent of the integrand. We then divide this contour into 
four (or five) disjoint sections and show that the major contribution 
to the integral comes from one of these sections. 


We next derive upper and lower bounds to the spectrum S,(f) of the 
PM wave and show that these bounds are very close for all f and for all 
values of rms phase deviation N = 5. For N 2 (10)? we show that the 
spectrum can be evaluated by this saddle-point method in a very 
simple manner with a very small fractional error (less than 10 percent), 
and we give this method. 

We finally compare the quasistatic approximation to the saddle- 
point approximation. For large values of frequency f, we show that 
the quasistatic approximation gives too small a value for the spec- 
trum, and that it cannot be used in cases where the spectral behavior 
on the tails is of importance. However, as we show, the saddle-point 
method can be used in all cases in which N is moderately high. 

In conclusion, this paper gives a simple method of evaluating the 
spectrum of a sinusoidal carrier phase modulated by gaussian noise 
having a rectangular power spectrum and having a moderately high 
modulation index. 


II. SPECTRAL ANALYSIS OF PM WAVES WITH RANDOM PHASE MODULATION 


A sinusoidal wave of constant-amplitude phase modulated by a 
signal n(t) may be written as 


W(t) = A cos [wot + n(t) + 4], (1) 


where A is the amplitude of the wave, fo = w)/27 is the carrier fre- 
quency of the wave, and @ is a random variable with probability density* 


] 
does < 

19(8) = 2r ’ Pal ane (2) 
0, otherwise. 


Assume that n(t) is a stationary bandlimited white gaussian ran- 
dom process with mean zero and variance N?.+ Its spectral density 


*If n(t) is a stationary random process the introduction of random variable 
6 in equation (1) makes W(t) a random process which is at least wide-sense 
stationary so that its spectrum can be calculated from the Wiener-Khintchine 
theorem.” * If we do not have @ in equation (1), W(t) is no longer (even wide- 
sense) stationary, and the spectrum of W(t) is usually calculated from the time 
pre aa function of W(t)? The results obtained in the two cases are 
identica 

+ The parameter N represents the rms phase deviation (or modulation index) 
of the PM wave given in equation (1). 
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S,(f) can be represented (see Fig. 1) as 


S.(f) = fos. Naese (3) 
0, otherwise. 


Such a random process n(t), is often used to simulate a multiplex 
signal, a composite speech signal, and so on.? 5 

We can show from equation (3) that the covariance function R,,(r) 
of n(t) is given by 


2sin 27W7r | 


2nWr ’ (4) 


this function R,(r) is shown in Fig. 2. Since n(¢) is a stationary gaus- 
sian random process it can be shown that W(t) is at least wide-sense 
stationary and that its covariance function Ry(r) can be represented 
as? 7 


R,(7) = N 


Ry(s) = 4 exp [—Ry(0)] exp [Rx(7)] ¢08 wor. (5) 


From the Wiener-Khintchine theorem, and from equation (5), the 
spectrum Sj (f) of W(t) can be written as 


Sw(f) = [ Rw(s) exp [—i2efr] dr, ©) 
or 

Sw(f) = 4 (Sv — f) + Su + fl 7 
where 


Sri) = [exp [-Rx(0)] exp (Ru(0)] exp [-2efr] dr. 8) 


SPECTRAL DENSITY 
IN RAD2/ Hz 





W FREQUENCY IN Hz 


Fig. 1—Spectral density of phase modulation. 
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Fig. 2— Covariance function of n(t). Since Ra(r) is an even function of 7 we 
only show R#.(r) for r = 0 


From equations (4) and (8) we have 


Si) = sag [ex {-w"[1 ne) exp (—apldp, © 


where 


— a (10) 


III. SERIES EXPANSION OF SPECTRUM FOR GAUSSIAN MODULATION 


The integral in Equation (9) can be evaluated by expanding 


ao(-wfi 2) 


into a Taylor series; integrating term by term we can write* 


exp {-n'{1 — sng | = exp [—N’] D3 x (Be) (11) 


* We note that )°?~0 2*/n! converges uniformly to exp [z] for all finite values of z. 
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From equations (9) and (11) we have* 


Sy) = exp [-N"] 
{ap toto ETL (C2) ew -nal} a, 02 


where 8(f) is the Dirac delta (unit impulse) function. 
We now note that 








[22 exp [-i0l dp = F.0) = : sede hig 
Be 0, otherwise, 
or 
FAA) = xlui(A + 1) — uA — DI, (14) 
where u_;(x) is the unit step function defined by 
Pe | zx> 0, (15) 
0, t= 0, 
and that? 74 
a) id £ 
[- (22Y ex (il ap = #20) 
oe DS, ye (al te— 2h) = & 
ale 2D arg@—p OSD <4422 ae 


0, otherwise, 


where 


ie ine| +1] ; (17) 


and INT|z] represents the largest integer not greater than x. 

It can be shown that F,(\), ¢ 2 2 is a continuous function of \ and 
that F(A) is discontinuous at \ = 1. For large ¢, we can show from the 
central-limit theorem{ that” 


FAA) ~ (Sx) exp | =|. (18) 


* The term containing 4(f) in equation (12) represents the de component of Sy(f). 


t 
t See pp. 362-366 of Ref. 2. It can be shown that Gena can be interpreted 


as the characteristic function of the sum Q of £ independent random variables with 
identical uniform probability distributions.6 The function F,(A)/2zr_ therefore 
represents the probability density function of Q. Alternatively F(A) is the (€ — 1)- 
fold convolution of the flat spectrum with itself. 
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From equations (12), (18), and (16) we can write 


Sy) = exp [-W"I] a) + 5h SS raw)|- a9 


For N’? = 6, we have calculated the spectrum from equation (19) 
and the results are shown in Fig. 3. Notice that the spectral density is 
discontinuous at f/W = 1. 

For N’? « 1 (low-index modulation), we have from (19) 


SD & Sul) = ew (-W'] + A am], — eo 


and the error in this approximation may be investigated from equations 
(9) and (20).* The approximation given in equation (20) represents the 
low-index approximation for the spectrum; this result has been obtained 
by many authors.*~° 

The series given in equation (19) may be used to estimate the princi- 
pal part of the spectrum (close to the carrier) for small or moderate 
index, since only a small number of terms need to be included in the 
partial sum. However, for large N*, or far down on the tails of the 
spectrum for small N’, too many terms would have to be included to 
estimate the spectral density. In fact for N® >> 1, or for f/W > 1, 
the degree of complexity required in estimating S;(f) from equation 
(19) becomes inordinately high. 

When N’ > 1, and for low frequencies, several authors have given’— 
the quasistatic approximation ft . 


7 


Sy(h re exp (-N) at) + ie (2) exw | 2 (E) | en 
NW \2r 2N” \W. 
for the spectrum. The question arises whether equation (21) can 
be used for large f. Since Ry(r) is infinitely differentiable there is 
no simple way (known to the authors) of investigating, for large f, 
the error is this approximation.” 

In the problem of interference between two neighboring channels 
it is necessary to evaluate Sy(f) for large f so that the effect of this 
interference can be determined. As we shall show later on in this 
paper equation (21) gives too small values to Sy(f) for large f; it is 
therefore essential to have a simple and elegant method (different 
from the series method) to evaluate Sy(f) for large f and for large N?. 


* At times the low-index approximation for the spectrum is written as exp 
[—N]5(f) + (N?/2rW) Fi f/W). For N? «<1, exp [—N?] = 1. 
Note that mean square frequency deviation is N?W?/3. 
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Fig. 3— Spectral density of a PM wave for N? = 6. The discrete part of Sy(f) 


jy f = is not shown in this figure. Note the discontinuity in the spectrum at 


Readers who might be interested in the final results without wishing 
to work through the detailed analysis, might skip Section IV of this 
paper. 


IV. SPECTRUM EVALUATION BY CONTOUR INTEGRATION 


Let us now consider the integral given in equation (9). Since 8(f) 
and F,(f/W) are discontinuous functions of f, let us define an integral* 


SQ) = i exp [-N'Kex E sing | — (1 + N’ sin) cos Ap dp 


*In Ref. 14 this integral has been studied by Lewin for }\ = 0 and A = 1. It 


also occurs in several limiting cases in Ref. 1. It is sometimes referred to by 
the name Lewin’s integral? 
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or 


Sf) = iz exp [-N'Kexp & sng = (1 4. N? sing) 


-exp(—jdp) dp. — (22) 


Since all F,(f), £ = 2 are continuous, it can be shown from equations 
(19), and (22) that S(f) is a continuous function of f. 
From equations (9), (19), and (22) we can then write* 


So(f) = exp (wy, acy $+ 2 tual + W) = wall - wy} 


i 
+ ae Re Sf). (28) 


Notice from equation (22) that the integration is carried out along 
the real axis, and that for large | | (or | f |/W), the final factor of the 
integrand exp (—jAp) is a very rapidly oscillating function of p. From 
Refs. 7, 10-13 notice that in such circumstances the method of steep- 
est descent (or saddle-point method), or one of its close relatives, is 
often useful to get an approximate expression for the integral; we 
shall now apply such a method to evaluate Sy(f). 

In applying this method to the evaluation of an integral with a 
real variable of integration, we must first be able to regard the inte- 
gral as a contour integral along the real axis of the complex plane, 
with an analytic integrand. We note that the integrand in equation 
(22) is an analytic function of p, and that it has no singularities in 
the finite part of the complex plane (defined by p). From Cauchy’s 
theorem it therefore follows that the contour of integration can be 
arbitrarily deformed as long as one end is at p = — o + j0 and the 
other at p = o + j0.12 

In making use of the method of steepest descent the contour must 
be deformed so that the phase of the integrand remains constant (or 
almost so), while the magnitude of the integrand is small except in 
one or more localized regions, where it varies rapidly. This is usually 
accomplished by deforming the contour so that it goes through one 
or more saddle points. In other cases there may be some additional 
constraints on the contour;’ then only a portion of the path of steep- 


* Re z and Im z denote respectively the real and imaginary parts of complex 
number 2. 
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est descent through a saddle point may be used in finding the integral, 
and the deformed contour may not actually reach the saddle point. 
In this case the original integral is usually reduced to a virtually real 
integral whose integrand behaves sufficiently simply on the modified 
path of integration so that an approximate evaluation of the integral 
with rigorous (upper and lower) bounds on the error may be obtained. 

Departures from the strict method of steepest descent occur in this 
paper in that approximate paths of steepest descent are chosen. Al- 
though not quite optimum, they are analytically tractable and serve 
to give useful bounds on the integral under consideration. 

Consider equation (22). Since the integrand in equation (22) is 
an analytic function of 7, let us assume that p = x + jy is a complex 
variable, and let us deform the contour so as to obtain the path of 
steepest descent. Since the integrand behaves properly on the contour 
for large |p|, it is clear that the contour of integration can be de- 
formed in quite a general way in the complex p-plane without modify- 
ing the value of the integral. 

From equation (22) it can be shown that the major portion of the 
integrand 


R(p) = exp [-N"] xp | N° 222 | xp frp (24) 


has a saddle point on the imaginary axis, with the path of steepest 
descent parallel to the real axis at this point. The location p, = jy, of 
this saddle point is given by 





cosh y, sinh y, ON 
a an ae Ge oe 


for a given f/N?W, equation (25) can be solved numerically to give 
the required y,. We plot ys as a function of f/N*W in Fig. 4, and In 
R(jys) in Fig. 5. 

Let us now deform the contour so that it passes through the point 
DP; = jy. and is parallel to the real axis at this point. From equation 
(22) we then have 


S(f) = exp | —20°( cost e = sah.) | Rel (26) 


where 
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I = exp | -w sinh Ys us| 
Ys 


as 2 sin (x + ive _ | 2 Sin (@ ++ iu. |\ 
he {en [wv a+ jy. ao iY 

-exp [jAr] dx. (27) 
Rewriting equation (27) 


r= [° Ge,y) ae, (28) 


where 


G(x, y.) = exp [—Qr(@, y-)] exp [JQr(z, y.)] 
_ __ 72 sinh us] { 2 sin (¢ + iu 
exp | N va 1+ N eo a i exp (jA2), (29) 
where Qr(x, ys) and Qr(z, ys) are real and 
_ anlsinhy, _ E (@ + iu | 
Q2(2, Ys) wy Ye Re x +. Ys (30) 


10 














MATION 











Fig. 4— Location of saddle-point Ys. 
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In R(jYs) 
2N2 























Fig. 5— Value of —[In R(jy,)}/(2N?). 





or 
1—-—v sin x 
tanh y, =z a 1 — cosz 
_ y72Sinh y, 2 Ys n° 
Or(x, Ys) a N Ys x 1 +. (x/y.)° (31) 
and 
QAx, y.) = N? Im je bel + dz, (32) 
or 
Q(z, Ys) 


{einh y, COSZ — y, cosh = 
2 zx am | 


cosh y, sinh LJ 
= N Se ee See , 
| a” + Ys Ys Ys aS3) 


The functions Qz(z, y,) and Q;(z, y,) have been plotted in Fig. 6 for 
a set of values of y, . 


Since we are primarily interested in the high-index case let us assume 
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that N* >> 1. From equations (27)-(83) we now observe that the 
principal contribution to the integral J comes from small xz. For small z, 

G(a, y.) © exp [—Qz(a, y,)]. (34) 


It can be shown (see Figs. 6, 7, and Appendix A) that Qr(z, y,) is a 
monotonically increasing function of x for 0 S$ x S 7 and that it oscil- 
lates for values of x > w. For large 2, 


10 





















































Fig. 6— Functions Qp(z, ys) and Q,(a, ys). 
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Fig. 7— Function Qp(z, ys). From this figure, it can be seen that Qpr(z, ys) isa 
monotonically increasing function of x forO S$ x Sz. 


G(x, y.) & exp | -¥" le us| 


{exp E cosh ¥, sin exp a sinh y, == | 


fe E + N° cosh Y,——~ + jsinh y, 2) Nem (35) 


wv x 











and we note that the first and second terms both have small and 
almost equal magnitude, and almost opposite phase angle, so that 
they almost cancel. As |x| — o the cancellation becomes exact. For 
these reasons it is convenient to divide the range of integration in 
equation (27) into at least four regions: 


O0<|x|<2,,  small|z|, 


| ae ee intermediate | x |, 
r<|z2|_<m, intermediate | z |, (36) 
te < |r| < o, large | x |. 


From equations (25), (30), and (32) we can show that* 


, * These expressions for Qr and Qr may be obtained from the Taylor series 
expansion of the function [sin (x + jys)]/(a + jys). 
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2¢ 


_ 2° = _4\é-1 2 
Qu(t, ys) = N* De (—1) Ane Gp» (37) 
and 
(ewe y Cy aie (38) 
I ’ 8 = 2f4+1 (2¢ + 1)! ’ 
where 
_ sinh y, 
Ay = as ; (39) 
_ coshy, sinhy, A 
A, = aos yp NR? (40) 
Ay = SO op Beep 1,2 Be, (41) 
Ys Ys 
and 
Aven = SEEM ee, £1,258, aR) 
It can also be shown that 
0 1 aide 
Ana = a. Lope? PS eee (43) 
and 
) 2¢ 
Beg a te b=1,2,3,-++. (44) 


490 + 2k + 1 (201? 
Since the spectrum is an even function of f we can assume without 
loss of generality that 
y, = 0. (45) 


For y, 2 0, from equations (43) and (44) all A,’s are monotonically 
increasing functions of y, , and we can further show that 

0< Asr+1) < Ax ’ k= 0, i, 2, eae (46) 
and 

OS Ager S Aver, = 1,2,3,---. (47) 


For large y, (for large f/W), it can also be proved that 


Age © Agen & ee (48) 
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Since it appears that the main contribution to the integral J comes 
from the region of small |x|, assume that | 2, | is small and that 


= I, + Ip ? (49) 
where 
n=] Gey) de, (50) 
and 
In= [Gey de + [ GC, a) de. (51) 
For small |x|, we have from equations (37) and (88) 
Qr(x) & N7LA.2’, (52) 
and 
Q:(a) & N°LA,2°. (53) 


Let us choose* x, so that exp (—3N’A,2”") falls to exp (—5) & 0.0067 
for « = 2, ,f or that 


10 \} 
ais: Con en) 
Since it can be shown from equations (39)—(44) that the minimum 
value of Ag is $ (at y, = 0), 


30\ 
vy < (3) F (55) 
Assume that 
Hay < Ty; (56) 


so that exp {—Qr(a, y.)} is a monotonically decreasing function of x for 
0S xz S x, . Equation (56) will be satisfied for all y, if 


n? => 2 ~ 3.039. (57) 
Tv 


Since we are primarily interested in the high-index case, this is not 
a significant restriction. 
* See equation (34). 


t While there is some degree of arbitrariness in such choices, the bounds 
obtained are not too sensitive to small variations in the value of 21, x2, and so on. 
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Noticing that 














sina < 2, OS2< ~, (58) 
sin @ + iv) | _ in’ 2 sinh yt < (Limb) 69) 
t+ Ys (x® + 3)? ~ ety; 
Since (sinh y,)/(y,) = 1, it can be proved from equation (59) that 
sin (« + jy.) | — sinh y. (60) 
ey ~ Ys 
We can show from equations (29), (50), and (60) that 
[1 <2] fexp [-Ge@, wl + H@,y)} de, (61) 
where 
A(x, ys) = exp | —wesimhe [li + N’ sinh. |, (62) 


From equation (37) 


! 1 1 6! A 
Qr(z, Y.) = wih Ax = 4 A,x* + aI Aco'| oe sia | 


2 “| _ 10! Ar» ‘| = \, 
gfe Le lA ee) 
From equations (46) and (63) it can be shown that for all y, 


O4e4) = w|2 Ane si Ace | . O<2S5m™ < (56). (64) 


We then have 


[fils a exp {-w|4 A,v” — 7 A.at |b 2 / H(x, y,) dz. 
0 0 





(65) 
One can show that 
e* — 1 
esl+— 34 Os tZR (66) 
Since we have 
NN’, 4 < 254A, 1 
085, 4° 354,’ Of75n, (67) 
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exp | 25 de 1 ] 1 I 
; ‘ 2 —— 

i Wickes | sof” exp [— —N*}A,o')}1 + SO 42 Ask — atl a 
1 


42 if “eave 29 a exp [—N?2Ao2"] 
0 0 








| ss | 5 4 Ag-54 .|- i 
2 ie} 
+2 aes / z* exp [~N*2A.2"] dx 
x 0 
+e" H(x,y) de. (68) 
From equation (68) it can be shown that 
lil < () a + (69) 
1 NAS 13» 
where 
ental 4 
2, = fi {eo|# Nia | 


A 
4 . . 
Ee 2(°) exp | -1" sinh wl + N? sinh y I. (70) 
Since we know that 


—|p| Ss Rep s|p|, any arbitrary complex number, (71) 


equation (69) gives an upper bound to Re J,. Let us now find a 
lower bound to Re J. 
From equations (29) and (50) we have 


Rel = 2] exp [—Qnle, y.)] cos (Q,(@, v.)] dx 
z * op | —y? Sinus || ere al 
2Re | exp | N a: + px |} 1+N eae 


(72) 
As shown earlier in this paper 


ie 2 ao sinh y, | 2sin (x + jy, | a 
2Re | exp | - N + pAx PS ea 


< 2m exp alae sinh ue Hy + Nt sinh, (73) 
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One can also show that for z real 


2 
cosz21—F, —~a <z2< ow, (74) 


Using equation (74) we can write 


2f Sebi Gaus eew 10 eal ae 


= 2 [ exp [—Q:(z, me = Cie. vd] dz. (75) 


Now from equations (37), and (38) we have 
1 1 4 Ag 2 
Qr(x, ys) = wi} Ax” — rt Aut (1 ia,? 2’) 
Aio 
A 


4! 
6! 
1 8! 2 
~ 2 ae(1 - BT Ae a (76) 


8! 
1 1 Bt A 
Qa, Ys) = wf} A;x° = Bl Asx 1 _ 7 4. a} 


14 of, 91 Ans =o 
1 Ay! ot Aus aah (77) 


From equations (76), and (77) one can show that 


and 


Qr(z, Ys) S 4N’A22", 0O<2esm < (30), (78) 


and 
Q(x, y:) S 4N7A32*, OS2S 2, < (42). (79) 


- Equations (75), (78), and (79) yield 


IIA 


2 [exp [—Qele, we] c0s [@(, y.)} dx 
> 2 ie exp [—Iv* Aa 1 — x Aja | dx 
= 2] exp [-3N*A"] de — 2 | exp [—3N’A.2"] dx 


4 21 
—- A; / x exp [—4N’A.2"] dz. (80) 
0 
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Notice that 


| x exp [—3N’ A227] dx < [ x exp [—3N’A.2’] dx 
0 0 





1/2r \! 15 
ane 2 (27) (N?A,)* (81) 
We can also show that 


ce) 2.4 
/ exp (—p’?’) dt < ae) , ap #0. (82) 
a? 2a°p 
We can, therefore, write 
‘i 2 Ns 
—1N?2 4.42 _ 4) o-s 
2 [ exp [—3N°A.2°] dx < (7) e. (83) 


From equations (72)—(73), (75), (80)—(83) it can now be shown 
that 


2r_\} j 
Re J, > (22-) [1 — £7], (84) 


where 
| ee 51f/A,\? 1 (3) 
Poze 5 LEN [ees aed 
oe (57)? 24 (42) N’Az 2 T 


“exp | -"" smh SiN sinh. (85) 


We shall now obtain upper and lower bounds to Re Jz in equation 
(51). According to equation (36) let 





Ir = Ts + La (86) 
where 
b= | Ge,ydx+ f Gey) ar, (87) 
and 
I, = ‘a G(x, ys) dx + i G(x, y,) ax. (88) 


From equation (87) we have 


nl s2f | Gte,y) | ae. 9) 
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Now it can be shown from equations (29) and (60) that 
| G(x, Ys) | = exp [—Qr(z, Ys)] 


+ exp | -1" sinh | 3? sinh ue]. (90) 


From equations (89) and (90) we can write 
[To] 32] exp [-Qele, yl ae 


+ 2(r — 2) exp | -w" sinh | y +N? siohy |, (91) 


From equation (64) 


Qp(z, Ys) = N° U Ax” a = Ax" = N’v, UM s x = us (92) 
2 24 
where 
v= 5 Asa” — xi A,z". | (98) 


It can be shown (see Fig. 8) that v and dv/dz are positive forz, SxS 
tm = (6)3(A,/A,)? = (6)' and that dv/dz is a monotonically increasing 
function of x for x, S x S x, where 


, = vi (42) = vi. (04) 


We also know that Qz(a, y,) is a monotonically increasing function of x 
for z, S zx S w. Let us now assume that 7, < V2. 





x V2 Xn Xm 


d 
Fig. 8— Functions v and ae appearing in equation (95). 
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Equations (92)-(94) therefore yield 


2 ig exp (—Q2(z, y.)] dx 








29 if exp [—N%] o ay + 2¢r — V2) exp [—Qz(v2, y.)], (95) 
where 
_~5[,_54e_1 | 
V1 — N? E 6 A, N’A, ’ (96) 
ne ee (97) 
6 
and 
SS — (98) 
Since we know that 
de 1 1 Nx, 
0s —=>s = OR] 99 
= Wy ed 10] — As | (99) 
dx dz l\inz, Bae NAG 


we can write 


2 / exp [—N’v] x ay < f:—54 i exp (—N’v) dv 


< 
sf = 


Since it can be shown that 


is exp [—p*a"] 
i exp [—p’é] di = —*+_,+—_ , 








from equations (91), (95) and (100), we have 


[Iz] < 





N’x, - 2 
exp (—N°v) dv. 100 
5A, 1 oe ( ) (100) 
3 A, N’A, 
2 0, 101 
- p (101) 
2 3 
) (102) 
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where 
B= Gt — pa | Bw) | 
3 A, N’Az 
+ 2@r — v2 a pee 


x, < v2. (103) 


Similarly, if z; > +/2, one can show that 
oA 4 

9 ) {exp [—Qr(a1 ’ Yys)] 
vie 


E, = 2(r = o)(¥ 
+ exp | -1" sahue | oT sinh}. (104) 


& 





Let us now consider the range of integration <2 < ©. Forxz>y, 
? 
exp [—Qr(z, Ys) = jQx(x, Ys)] 
exp | —(a" Smile Ys _ N? cosh y, on sin) 


8 





+ i(re +N’ sinh y, = cose) (105) 
and 
exp | - N? = oe 4 ine | 
1 +N? ron] exp ie N? = ae 4 ine | 
{1 + w[ ene cosh y, -+ j ~~ sinh < (106) 


Let us choose the point x, + jy, along the path of integration so 
that the amplitudes of the two terms in equations (105) and (106) 
differ by less than 10.5 percent [(N’ cosh y,)/xe S 0.1Jand their relative 
angle departs from 180° by less than 0.1 radian [(N’ sinh y,)/z2 S 0.1]. 
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Such a point 2. + jy, is given by 
L2 = 10N’ cosh y, . (107) 


We assume that 2, > + = 2, , or that 


NS a ~ 0.31416. (108) 


Since from equation (57) N? = 30/z’, this inequality is always satisfied. 
We shall now write 


ean, | (109) 
where 
pe [- G64) te i Gla, yd ae, (110) 
gad | 
io = i G(x, y.) de + iz G(x, y.) dx. (111) 


Noticing that 
| sin (x + jy.) | — _coshy, — coshy, 
: en A 112 
[e-Pgye| “ey 2 oY) 
we can show that 


|I,| 2 [exp [-Qn(e, v1 da 


i In (=) 
+ 2(x2 — m) exp | -1" ina + N’ cosh y, ne) , (118) 


2 7 
and 


2tr 


[exp (On, uolde s Df exp [-@ale, w)] ae 


2f-1) 4 
K 


(2441) 7 
+ Df exp [-@n@, dx, (4) 


l= 


where K is an integer such that 


(2K + 1)t > zt = (2K — I)z. (115) - 
One can show that equation (115) is satisfied if 
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ies iwe| 4 | 
Qr 





2 
or 
2 
qs rwp| SN" cosh ve ea i. (116) 
T 2 
We now have 
pe mt tn te 
(ay +y ae ty atl ty’ 
nm SxS (n+ 1)r, n=1,2,3,:-:, (117) 
and 
Ys sins ol — cost. 
tanh y, «x T Ys x 21, 
(26— 1)" S$ x S 2x, 61, 25:3, 88s (118) 
From equations (31), (117), and (118) we can now prove that 
K 2tx 
Dee exp [—Qz(z, y.)] dx 
t=1 (24-1) 4 
K . 
eS 
4=1 Ys 1 + Ys ne 
a (2¢ — 1)’ 
. K 2 . 
= 7 exp | —w 7 a us| > exp a Sete See Z| 
8 t=1 2 gs 
(26— 1) +73 
(119) 


Further it can be shown that* 


. N’y, sinh y, 1 | 
2, | wo (= 1) + oth 


N’y, sinh y, 1 
< op | seh | 





N*y, sinh y, 1 
+ (K a 1) exp | Neue sinh ssi] (120) 


*The upper bound derived in equation (120) can be improved in various 
ways. Since this makes only a minor contribution to the total integral we shall 
be satisfied with this simple bound. 
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From equations (119) and (120) we have 


Sf, exp [-@ale, 1 ae 


t=1 (20-1) 


_ 72 Sinh y, N’y, sinh y, 1 | 
re | a? Kew | mo 1+ @,/n) 


*y, sinh y, 1 
+00 — 0 om [ge alt 


For 2ér + 7/2 S x S (26+ 1)a, ¢ = 1, we can show that 








2 
2) = mat, ape? + ope a — COs 2) 
= J (2). 
In this range of x 
cosz < 0, 
sin x = 0, 
and 
ade Us 





2 ssinz = 0. 


Ys 
cos x + Ql+ D's 


2 
dx —S—étacrmnh , (40 -4+ Dr 


We, therefore, have 








_ _ Y. sin x 21 — cosz 
: tanh y, =z T Ys x” 
=J(x)2 14 2tx af z) = Vy), 
where 


2 





a fa cas A 
Vay.) = 1 tanh y, (4€ + 1) + (26 + 1)*x” 


It can be shown that 


2 


"OO = 1 GED 


(121) 


(122) 


(123) 
(124) 


(125) 


(126) 


(127) 


(128) 
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that V;,(y,) reaches its minimum at 


40 


and 





[Velys)|min 1 — AE tare | >0, ¢€21. (180) 


Now for 2ém# S x S 2¢a + 7/2, it can also be shown that 














_ Us sin x 21 — cosz Vs sin x 
tanh y, x Ts x” =! tanh y, 2r 
Sp oe i Bee ey AAS 
T @& + 1)*r” rene 
One can prove that Z,(x) reaches its minimum at 
7 .[4¢+1" | 
x = 2tr + tan [ee cena (132) 
and 
a = 4y, 
[Li(X) Jin = Uys) = vp 1)" 2 
16y: ys | 
cL + 1)*r* fe Al’r” tanh’ y, (138) 
Next we can show that 
2 
U{0) = 1—- - 5 aa Gia ihe = V (0), (134) 


that U,(y,) is a monotonically decreasing function of y, (see Fig. 9), and 


: aa ee (44 + 1) 
is Uy.) = 1 3977 


It can also be proved by numerical methods (see Fig. 9) that 


> 0, £21. (135) 


Uys) 2 Uilys),y¥ 290, €21, (136) 


and 


Vilys) > Uilys), Ye C21, (137) 


IV 
S 
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We therefore conclude that 


_ _y. sins 21 — cosz Be _ Ay, 
tanh y, x TY: x = Uy.) = 1+ 253° 


16y? y. ) ss a 
| ie Ar? tanh? Ys ’ {2 1, Y= 0, 














2r = x 
Equations (31), (117) , and (138) show that 
(y,) (26)°x” 
(20)'a + yt’ 
2lr Su S (26+ 1)r. (139) 


IIA 


(2¢ + Ir. (138) 
Q(x, ys) 2 N’ _ sane U, 


From equation (139) we can now write 


K (2f+1) 4x 
Sf exp [-Onle, 9] de 
t=1 2ir 
K . 2.2 
_ a2 Sinh y, (26) "3 ] 
< > vis exp | N Ye U,(ys) (20)°x” + Ya 


= 7 exp | -1" il vw» | 


3 exp | Hive sinh x Ui(ys) om dah (140) 
It can be shown that 
Fo [HM U6) oar aay 
< op [THEME Hn a] 
+ (K — 1) exp | Soe sint Ui (ys) aoioel (141) 


Equations (140) and (141) yield 


K (241) 4 i 
Sf exp [-@ele, .)] de <x exp | -wi sah vi» | 
wl v2 8 


2 Y, sinh y, 1 | 
{ow E ro OW) ET ye 


+k = 1) exp & Us MSD 4: Uy.) ae}. (142) 
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From equations (118), (114), (121), and (142) we can write 


Qn \t | 
[Is | < (522-) E;, (143) 
where 
S WA: [ sich | 
Ly = ( on 27 exp | —N * 


N’y, sinh y, 1 ) 
Lex ( rr 1L+(y./m? 


*y, sinh y, 1 
+ OC — 0 exp (EE | 


+ 27 exp | - Sm vw) | 
2 Y, Sinh y, 1 | 
{eo Lv ei) Tyla 


2 Y, sinh y, 1 
a == 1) exp E are ae U,(ys) ers 


in (2) 


+ 2(x2. — 7) exp farsa] + N’ cosh Ye ape )} (144) 


Finally, from equation (111) we have 





[nis2f |@@,v) lac. (145) 


Now from equations (27)—(28) 


| G(a, y.) | = exp | -w" sinh 





Ys 
2 sin (v + iu. _ | 2 sin (& + iu : 
op E a+ is enue Tae ae 
If zis a complex variable, it can be shown that 
jes @ —1-2| < EL exp |e. (147) 


From equations (112), (145)-(147), we can write 
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| I. ] S exp | —1" sinh ue [ys cosh’ y, 


a 1 AS E cosh | a 
nety Le + 


= N* cosh’ y, bas | -1" sinh us] 


8 Ys 
a cosh yo/(ea*-+ye2)! exp (Z) 
. exp) 
ee a7), 
sh’ Yo _ 

Ys 
4 2 j 

S Neos ts ae | -1" shy | 


sin! ys/(tg2+ye?)? 2 
| exp jie cosh sin | dé. (148) 
0 


8 


Since 


O<snés 0, OS¢@Ssin" (149) 


Ys 7 
G+ yi +2: 
we can show from equation (148) that 


4 2 s 
lll < N os acs | —¥" = u.| 


sin? ys/(ra2+ye?)? 2 
| exp eae | dé 
0 


8 


= N’ cosh y, exp | -1" sh u| 


8 


2 cosh y, . ~1 Ye 5 | _ \ ; 
{eo [LW y " @+yyl—! oo 


Now we have 


0<sin’o S xo, 0 


IIA 
IIA 


o 


1. (151) 


NIA 


From equations (149)—(151) , we can write 


| Iz | < N’ cosh y, exp | -" “ae 


, ies cosh Y. | iz \ : 
(ep E N (1O0N® cosh? y, + y 1? < N coshy, 
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ae | = sinha. || ea (z) - | = (2 +) B. (12) 


where 


| (NPAs) 9 Sth. exp (2) — 1] 
KE, = ( on VA: cosh y, ex | N exP | 55 (153) 


From equations (27), (49), (69), (71), (84), (86), (102), (109), 
(143), and (152) we can write the following bounds for Re I: 


(72-) @ - Bt E, ~ E, — Ey} 


N’A 
< Rel < (St 


It has been shown that 





V+ B+ ht B+). (54) 


A, = Shwe 21. (155) 


V. UPPER AND LOWER BOUNDS TO Sy (f) 


We have shown in the sia section that 
Sy(f) = exp (—N san + Se ful OW): Stat = Ww} 


1 Se 2Y. sinh || 
+ SW vol 2N | cost 2 ge Ue (156) 
where 





onc 
(F8-\'t1— Ef — B, — B, — By 
4 2 


<a < (at -)'u +E, +E, +E, +E), (157) 


cosh y, _ sinh y, f 


» ww? ae 
_sinhy, 2 f 
A» =~ Ys Ys N°*-W (155) 
Parameter Equation 
Ey 70 
Et 85 
E, 103. or 104 
Es ; 144 


Ky, 153 


800 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1969 


For N* = 10 and 25 we plot, in Figs. 10-16, Z, , EH’, E,, FE; , and E, . 
Notice that £, , Hi, H, , H; , and EH, appearing in these bounds are 
all very small compared to unity so long as the modulation index is 
moderately high, and that F, , H; , and H, are monotonically decreasing 
functions of f and N’. Also notice that H{ and EH, may first increase 
(see Figs. 11, 12) with y, (or f), reach their maxima and then decrease 
with y, .* It can be shown that these maxima are all very small com- 
pared to unity for all N’ which are even moderately high. 

For all f, we can then write 


(27-) « ~OQ<p< (28-) a + D), (158) 
where | 
C=HM+h+h+h, 7 (159) 
and 
D=LH,+84,+4, +2,. (160) 


From Figs. 10-16 and expressions for C and D, we can show that 
C and D are both small (<2%) compared to unity for N? > 25 and 
for all f. Hence we deduce that . 


4 
ne (WE) asy 
and that the fractional error in this approximation is very much 
less than unity (<2%). 

For N? = 10 and 25 the spectral density Sy(f) and the fractional 
error C and D obtained from equations (158)—(161) are plotted in 
Figs. 17-20. From these figures notice that C and D are less than 
10 percent for N? > 10,7 and that 


Co 29, for iN: = 25, (162) 
D<2%, for N* = 25, (163) 


proving the assertion made earlier in this paper. 
For N? = 6 the spectral density obtained from equations (158), and 
(161) is given in Fig. 21; the percentage error between this spectral 


* One of the terms in Fi’ is independent of f and N’. 

+ By modifying the contour of integration we also have been able to show 
that C and D are less than 8% for N* = 10. Since this modified contour leads to 
unnecessary complications, we have not given that modified contour analysis in 
this paper. As 
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Fig. 12— Parameter #2 as a function of Lae ia 
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Fig. 15— Parameter E, as a function of far z with N? = 10. 
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3 )3 
Fig. 16— Parameter FE, as a function of (sar Z with N? = 25. 


density and that obtained from equation (19) has been plotted in 
Fig. 22 (for a set of values of f/W). The scatter diagram in Fig, 22 
indicates that the spectral densities obtained from the two methods 
agree very closely and that the saddle-point approximation error is 
not related in a simple way to the truncation error (it does not seem 
possible to draw a smooth curve through the points shown in Fig. 22). 

For all practical purposes, including interference calculations, esti- 
mation of the spectrum to such an accuracy is almost always suf- 
ficient. It can therefore be said that the saddle-point approximation 
given by equations (25), (155), (158), and (161) is a good approxima- 
tion to Sy(f) as long as the modulation index is even moderately high 
(N2 > 10). The spectrum can be estimated by this method for all 
values of f even when it is millions of decibels smaller than the con- 
tinuous part of the spectrum at f = 0. 

Now compare the spectrum obtained from the quasistatic approxi- 
mation* to that obtained from saddle-point approximation. For this 
purpose, the spectra obtained from equation (21) for N* = 10 and 25 are 
plotted in Figs. 17, and 19. We see that the spectra obtained from the 
quasistatic approximation agree very closely with those obtained from 
the saddle-point approximation for low frequencies, but that the quasi- 


* See equation (21). 
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Fig. 17 —Spectral density of an angle-modulated wave, with gaussian phase 
ped eaee with a rectangular spectrum. N = (10)? ~ 3.162 radians, rms phase 
eviation. 


static approximation to S,(f) is too small for large f.* In fact for N* = 10 
the quasistatic approximation is 30 dB too small for f/W ~ 13.5. We 
have therefore shown that the quasistatic approximation to the spec- 
trum cannot be used in any interference calculations or in any calcula- 
tions where the behavior of the spectrum on the tails is of importance.f 
The saddle-point approximation can be used at all frequencies as long 
as N’ is moderately high. 

* For small f (or small ys) it can easily be shown that the saddle-point ap- 
proximation reduces to the quasistatic approximation. 


+ The higher the rms phase deviation, the further out will the low-frequency 
(quasistatic) approximation be valid, 
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0.08 











-0.08 
0 





Fig. 18— Bounds on fractional error in saddle-point approximation to the 
spectrum. NV? = 10. 


VI. RESULTS AND CONCLUSIONS 


A simple method (called the saddle-point method) has been pre- 
sented in this paper to estimate the spectrum of a sinusoidal carrier 
phase modulated by gaussian noise having a rectangular power 
spectrum. 

This method gives upper and lower bounds to the spectrum and 
shows that these bounds are very close for all f and for all moderately 
high phase deviations. We also show that the fractional error in the 
saddle-point approximation is less than 2 percent for N? 2 25 and 
for all f. 

The calculation of the spectrum by the saddle-point method is 
rather simple. For a given value of f, N?, and W, we calculate y, 
from equation (25) and A, from equation (155). The spectrum Sy(f) 
is then calculated from equations (156) and (161). 

We have also shown in this paper that the quasistatic approxima- 
tion to Sy(f) is only good at low frequencies, and that for large f the 
results obtained from that approximation are too small. 
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APPENDIX 


It can be shown (see Ref. 7, p. 114) that 





1 c) N? Ww 
Sy(f) = onW i exp |-% i (1 — cos 2rpr) au | 


-exp [—j2zfr] dr, (164) 
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Fig. 19— Spectral density of an angle-modulated wave, with gaussian phase 
modulation with a rectangular spectrum. N = 5 radians, rms phase deviation. 
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Fig. 20— Bounds on fractional error in saddle-point approximation to the 
spectrum. N? = 25. 
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Fig. 21— Spectral density of an angle-modulated wave, with gaussian phase 
pomueaen with a rectangular spectrum. N = (6)”? = 2.449 radians, rms phase 
eviation. 
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ERROR IN PERCENT 





i; 7 
2 
(Re) Ww 


Fig. 22 — Percentage error between the spectral densities obtained from saddle- 
point approximation and that obtained from equation (19). It does not seem 
possible to draw a smooth curve through the points shown in this figure. We 
have, therefore, shown the error as a scatter diagram. 


or 


S(f) = f {exp | -* [- (1 — cos Hp) au 


— exp (—N'|1 + N’ sing |\,p» dp. (165) 





From equation (165), it can be shown that 


I = exp | -1" sinh 
: Ys 


o 2 Ww . 
ff {exp E i: cos a (x + jy.) au | 


= E + N° sin iu) ]} dx, (166) 


and 


Ne eo a moe, 3 
aA _, 008 a7 (@ + jy) du, (167) 





cates yy = wel — ne 


810 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1969 


or 


: 2 
Qr(x, y.) = N” ea *- i cos & x cosh du. (168) 


Us : W Wy 
Equation (168) yields 


2 Ww 
Meet Ys) _ os [ isin 7 x cosh #> W y. d, (169) 
0 
For 0 S ur/W So, 
sn=2 20. (170) 
Ww —_— 
For y, 2 0,and0 Sup s W, 
0<aKr, (171) 
Bil pa Bade 
yin we cosh WY = 0, (172) 
and from (169), 
(QW) 9 OSaer. (173) 


Ox _ 


From equation (173) we then conclude that Qz(xz, y,) is a mono- 
tonically increasing function of x forO:S z Sz. 
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Correction Concerning Reflecting Objects in 
Coherent [lumination 


By L. H. ENLOE 
(Manuscript received January 31, 1969.) 


In a recent paper, the author analyzed and discussed the noise-like 
structure in images of diffuse objects in coherent illumination.* While 
the author’s interest and discussion concerned objects viewed in re- 
flection, a model illustrating a diffuse object was unfortunately shown 
in Fig. 1 as a granular transparency viewed in transmission. It turns 
out that the analysis presented is not sufficiently general to cover 
transparencies viewed in transmission. The object must be viewed in 
reflection because: 


(2) The direct beam, that is, the unscattered component, is not in- 
cluded in the fundamental equation (1). 

(2) The relative phase angles 6; of the individual scatterers in 
equation (1) are not unqualifiedly random in the forward scattering 
direction. 


I would like to thank D. Berkley for bringing this to my attention. 


*Enloe, L. H., “Noise-Like Structure in the Image of Diffusely Reflecting 
Objects in Coherent Ilumination,” B.S.T.J., 46, No. 7 (September 1967), pp. 
1479-1491. 


817 


